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10729895 and Search Result 
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Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 

20070112_180203_us-10-729-895a-2.rng. 

start I next page 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 

OM protein - nucleic search, using f rame_plus_p2n model 

Run on: January 16, 2007, 04:55:12 ; Search time 621 Seconds 

(without alignments) 
5759.684 Million cell updates/sec 

Title: US-10-729-895A-2 
Perfect score: 1909 

Sequence: 1 MPFLLGLRQDKEACVGTNNQ LLNTINEQDSPNSQSSSSPS 342 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 5244920 seqs, 3486124231 residues 

Total number of hits satisfying chosen parameters: 10489840 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 4 5 summaries 

Command line parameters: 
-MODEL=frame+_p2n .model -DEV=xlp 

-Q=/abss/ABSSWEB_spool/US1072 9895/runat_12012007_180203_7 306/app_query. fasta_l 
-DB=N_Geneseq -QFMT=fastap -SUFFIX=rng -MINMATCH-0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 - MO DE= LOCAL 
-OUTFMT=pto -N0RM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 -HOST=abss03p 
-USER=US10729895_8CGN_l_l_1147_@runat_12012007_180203_7306 -NCPU=6 -ICPU=3 
-NO_MMAP -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG -DEV_TIMEOUT=120 
-WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP- 6 -FGAPEXT =7 
-YGAPOP=10 -YGAPEXT=0.5 -DEL0P=6 -DELEXT=7 



Database : N_Geneseq_8 : * 

1: geneseqnl980s : * 

2: geneseqnl990s: * 

3:, geneseqn2000s:* 

4: geneseqn2001as :* 

5: geneseqn2001bs :* 

6: geneseqn2O02as :* 

7: geneseqn2002bs :* 

8: geneseqn2003as :* 

9: geneseqn2003bs : + 
10: geneseqn2003cs: * 
11: geneseqn2003ds: * 
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12: geneseqn2004as: * 

13: geneseqn2004bs: * 

14: geneseqn2005s : * 

15: geneseqn2006s:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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4 
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6 
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ADAS 32 55 
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97. 


9 
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12 
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97. 


6 
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0 
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Aas44 951 cDNA enco 




7 
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97. 


6 
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5 
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8 
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1 
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4 
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AadU/olb Human sec 
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184 6 


96. 
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3 
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1805 


94. 


6 


1715 


5 


AAS4 5139 
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86. 
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12 


ADP28830 


Adp2 88 30 Human sec 




12 


1353 


70. 


9 


4745 


4 


AAK65958 
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13 
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70.7 


4744 


4 


AAK65959 


Aak65959 Human imm 
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1294 


67. 


8 


69081 


11 


ACN4 4 64 2 


Acn4 4 642 Human gen 
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15 


1178 . 5 


61. 


7 


832 


4 


AAD07857 


Aad07857 Human sec 




16 
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49. 


0 


. 1006 


5 


ABA21206 


Aba21206 Human ner 


c 


17 
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48. 


1 


505 


4 


7i 7i t i 

AAI 17727 


Aail / /2 / Probe #/b 


c 


18 
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48. 


1 


505 


4 


ABA 62 668 


Aba62668 Human roe 


c 


19 
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48. 


1 


505 


4 


AAI42669 


Aai42bb9 Probe #11 


c 


20 


918 


48. 


1 


505 


4 


ABA2997 7 


Aba29977 Probe #84 


c 


21 


918 


48. 


1 


505 


4 


AAK36874 


Aak36874 Human bon 


c 


22 


918 


48. 


1 


505 


4 


AAK1104 6 


Aakll04b Human bra 


c 


23 


918 


48. 


1 


505 


4 


ABb Jbo4 8 


adsjdj4o Human liv 


c 


24 


918 


4 8. 


1 


505 


6 


ABS10887 


Abslubb / Human gen 


c 


25 


753 


39. 


4 


84973 
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ADH69807 6 


Continuation (7 of 




26 
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39. 


3 


439 


5 


ADL40673 


Adl40673 Human ova 




27 
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39. 
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577 
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ABZ JbJ8 / 


Abz36387 Human sec 
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37. 
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AAUU /bob 


Aad07856 Human sec 




29 
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A AHQft Q9 ft 


ApihQRQPft Human F^T 
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ADQ53832 


Adq53832 Novel can 
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13 


ADU02259 


Adu02259 Novel hum 
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33. 
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4 


AAI26927 


Aai26927 Probe #16 
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33. 
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4 


ABA75183 


Aba75183 Human foe 
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4 


AAI55731 


Aai55731 Probe #24 


c 
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33. 
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352 


4 


ABA39851 


Aba39851 Probe #18 
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352 
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AAK4 9818 


Aak49818 Human bon 
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37 
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352 
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AAK23717 


Aak23717 Human bra 
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33. 
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352 


4 


ABS49458 


Abs49458 Human- liv 


c 


39 


640 


33. 


5 


352 


6 


ABS23320 


Abs23320 Human gen 




40 


631 


33. 


1 


382 


5 


ADI7543T 


Adi?5437 Human' ova 




41 


631 


33. 


1 


382 


5 


ADI69090 


Adi69090 Human ova 




42 


. 579 


30. 


3 


462 


9 


ACH22 664 


Ach22664 Human adu 




43 


476.5 


25. 


0 


1183 


14 


ADY18021 


Adyl8021 DNA encod 




44 


476.5 


25.0 


1199 


3 


AAC77 716 


Aac77716 Human can 




45 


474 .5 


24. 


9 


1024 


3 


AAA60194 


Aaa60194 Hydrophob 



ALIGNMENTS 

i 

RESULT 1 
ADP48574 

ID ADP48574 standard; DNA; 1080 BP. 
XX ' 
AC ADP48574; 
XX 

DT 09-SEP-2004 (first entry) 
XX 

DE Human OPAL1/G0 DNA splice form incorporating exon 1 SeqID 1. 
XX 

KW human; gene; ds; 0PAL1; GO; childhood leukaemia; G protein beta 2; Gl; 

KW interleukin-10 receptor alpha; G2; IL-10; FYN-binding protein; PBK1; 

KW cytostatic; acute lymphoblastic leukaemia; ALL; acute myeloid leukaemia ; 
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KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
XX 
PN 
XX 



AML. 



Homo sapiens. 



XX 
PA 
PA 
XX 
PI 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



Key 
CDS 



Location/Qualifiers 

1. .1029 . . 

/*tag= a 

/product= "0PAL1/G0 protein (incorporating exon 1)' 



WO2004053074-A2. 



PD 


24- 


-JUN- 


•2004 . 






XX 












PF 


05- 


-DEC- 


•2003; 


2003WO- 


-US038738. 


XX 












PR 


06- 


-DEC- 


■2002; 


2002US- 


-0432064P. 


PR 


06- 


-DEC- 


■2002; 


2002US- 


-0432077P. 


PR 


06- 


-DEC- 


■2002; 


2002US- 


-0432078P. 


PR 


14- 


-OCT- 


•2003; 


2003US- 


-0510904P. 


PR 


14' 


-OCT- 


•2003; 


2003US- 


-0510968P. 


PR 


05- 


-DEC- 


•2003; 


2003US- 


-00729895. 


PR 


05- 


-DEC- 


•2003; 


2003US- 


-0527610P. 



(SCTE-) SCI & TECHNOLOGY CORP @UNM. 
{ SAND-) SAND I A CORP. 

Willman CL, Helman P, Veroff R, Mosquera-Ca ro M, Davidson GS; 
Martin SB, Atlas SR, Andries E, Kang H, Shuster JJ, Haaland DM; 
Potter JW, Wang X, Harvey R; 

WPI; 2004-468846/44. 
P-PSDB; ADP48575. 

New isolated Outcome Predictor in Acute Leukemia 1 (OPAL1) 
polynucleotides and polypeptides for preventing, treating or diagnosing 
leukemia, or for screening or evaluating compounds that may be used for 
treating leukemia. 

Claim 1; SEQ ID NO 1; 343pp; English. 

This invention relates to a novel isolated nucleic acid identified as 
OPAL1 (also known as GO), and the encoded protein thereof. Specifically, 
it refers to a method for predicting the outcome and risk classification 
of childhood leukaemia. The present invention describes a gene profiling 
method that can be used to determine the expression level for OPAL1, a 
gene that is strongly predictive regarding the outcome of both acute 
lymphoblastic leukaemia (ALL) and acute myeloid leukaemia (AML) . 
Furthermore, it provides additional genes namely G protein beta 2 
(referred to as Gl) , interleukin-10 (IL-10) receptor alpha (referred to 
as G2), the FYN-binding protein and PBK1 that can also be used, alone or 
in combination, in predictive assays for the classification and potential 
therapy for a leukaemia patient. Accordingly, these compositions exhibit 
cytostatic activities. This polynucleotide sequence is the human OPAL1/GO 
splice from incorporating exon 1 of the invention. 

Sequence 1080 BP; 249 A; 341 C; 292 G; 198 T; 0 U; 0 Other; 



Alignment Scores: 








Pred. No.: 


1.64e-104 


Length: 


1080 


Score: 


1909.00 


Matches: 


342 


Percent Similarity: 


100.0% 


Conservative: 


0 


Best Local Similarity: 


100.0% 


Mismatches: 


0 


Query Match: 


100.0% - 


Indels: 


0 


DB: 


12 


Gaps: 


0 



US-10-729-895A-2 (1-342) x ADP48574 (1-1080) 



Qy 1 MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 20 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I 

Db 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

Qy 21 SerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 40 

I I I II I I I I I II I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 AGCT AC AT CT GT GAC ACAGG AC ACT GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT 120 

Qy 41 GluLeuTrpTrpPheTrpLeuValTrpThrllellellelleLeuSerCysCysCysVal 60 
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I I I I I I I I 1 I I I 1 1 I I I I 1 1 1 1 I I I I 1 1 I I I I I II I I I I I I I 1 1 I i I I I I 1 1 1 1 I I I I I I 



Db 


121 


GAACTCTGGT GGTTCT GGCT GGTGTGGACC ATCATCATCATCCTGAGCTGCT GCTGTGTT 


180 


Qy 


61 


CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 


80 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 


240 


Qy 


81 


LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 


100 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


24 1 


CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 


300 


Qy 


101 


ProAsnTyrLeuLeuProProTyrGluGluVa lValAsnArgProProThrProProPro 


120 






1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


360 


Qy 


121 


ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 


140 






1 1 1 1 I 1 ! 1 i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 


420 


Qy 


141 


GlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerPro 
1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


160 


Db 


421 


GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 


480 


Qy 


161 


LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 


180 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


4 81 


TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 


540 


Qy 


181 


AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 


200 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 M 1 1 1 I 




Db 


54 1 


nArrTArrAnTTnArrnAf^AnrrArrAAAnprrrAnnGATnnAGrrr'AnTGGrTrTGTG 


600 


Qy 


201 


AlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGlu 


220 






1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


GCTGGCCT GGGGGAGCT GGACCCGGGGGCCTT CCT GGACAAAGATGCAGAAT GT AGGGAG 


660 


Qy 


221 


GluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrPro 


240 






1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


GAGCT GCT GAAAGAT G AC AGCT CT GAAC AC GGCGCACC CGAC AGC AAAGAGAAGAC GC CT 


720 


Qy 


241 


GlyArgHi sArgArgPheTh rGlyAspSerGly I leGluVa ICysVa ICysAsnAr gGly 


260 






1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 




Db 


721 


GGGAGAC AT C GC CGCT T C AC AGGT GACT CGGGC ATT GAAGT GT GT GT GT GC AAC C GGGGC 


780 


Qy 


261 


HisHisAspAspAspLeuLysGluPheAsnThrLeuIleAspAspAlaLeuAspGlyPro 


280 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


781 


rArrATnArGATnArrTPAAAnAf;TTrAArArArTPATrf;ATnATf;rTrTnr;ATf^;f^rrr 


840 


Qy 


281 


LeuAspPheCysAspSerCysHisValArgProProGlyAspGluGluGluGlyLeuCys 


300 






1 1 1 1 i 1 1 i i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


84 1 


CT GGACT T CT GC GAC AGCT GCC AT GT GC GGCC CC CT GGT GAT GAGGAGGAAGGC CT CT GT 


900 


Qy 


301 


GlnSerSerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 


320 






1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 11 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II I I 1 1 




Db 


901 


CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 


960 


wy 


OZ L 


p\rcT QiiT onT DiiBcnThrTI oAor\f~^l nlcnCorDrnflcnQo r*f^ 1 t~i Qd r~ C q r Co r- Q o t 
UybLcU Litr Uljt;Ur\o J 1 1 III. J. -L crtoIlkJi ULjJ. Ilnopj tr I. tr X. OrVoll jc I.OXIlotrI.oirI. ocIjci 


340 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 I 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 I 1 1 1 1 | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 




Db 


' 961 


TGCCTGCT GCTGAACACC AT CAACGAGC AGGACT CT CCCAACTCCCAGAGCAGC AGCT CC 


1020 


Qy 


341 


ProSer 342 




Db 


1021 


1 1 1 1 1 1 

CCCAGC 1026 





RESULT 2 
AAI72319 

ID AAI72319 standard; cDNA; 2976 BP. 
XX 

AC AAI72319; 
XX 

DT 15-APR-2002 (first entry) 
XX 

DE ISIGP-2 cDNA. 
XX 

KW Human; intracellular signalling protein; ISIGP; gene; cell proliferation; 

KW autoimmune; inflammation; gastrointestinal disorder; 

KW reproductive disorder; developmental disorder; ss . 
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XX 
OS 
XX 
FH 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



Homo sapiens. 

Key 
CDS 



Location/Qualifiers 
479. .1507 
/*tag= a 

/product= "ISIGP-2" 



WO200194391-A2. 
13-DEC-2001. 

07- JUN-2001; 2001WO-US018595 . 

08- JUN-2000; 2000US-02 10582P . 
16-JUN-2000; 2000US-02 1244 3P . 

(INCY-) INCYTE GENOMICS INC. 

Yue H, He A, Nguyen DB, Yao MG, Bandman O, Burford N, Tang YT; 
Xu Y f Hafalia A, Azimzai Y, Walia NK; 

WPI; *2002-154564/20. 
P-PSDB; AAB47872. 

New human intracellular signaling protein and polynucleotides useful for 
diagnosing, treating or preventing cell proliferative, 

autoimmune/ inflammatory, gastrointestinal, reproductive and developmental 
disorders . 

Claim 5; Page 102-03; 106pp; English. 

The sequences given in AAI72318-22 encode novel human intracellular 
signalling proteins (ISIGP). The polynucleotides and ISIGP proteins may 
be used for the diagnosis, treatment or prevention of cell proliferative, 
autoimmune/inflammatory, gastrointestinal, reproductive and developmental 
disorders. The protein encoded by this sequence has homology to human WW 
domain binding protein-1 

Sequence 2976 BP; 633 A; 817 C; 823 G; 703 T; 0 U; 0 Other; 



Alignment Scores: 








Pred. No.: 


4 .81e-104 


Length: 


2976 


Score: 


1909.00 


Matches : 


342 


Percent Similarity: 


100.0% 


Conservative : 


0 


Best Local Similarity: 


100.0% 


Mismatches : 


0 


Query Match: 


100.0% 


Indels: 


0 


DB: 


6 


Gaps: 


0 



US-10-729-895A-2 (1-342) x AAI72319 (1-2976) 



Qy 


i 


Db 


479 


Qy 


21 


Db 


539 


Qy 


41 


Db 


599 


Qy 


61 


Db 


659 


Qy 


81 


Db 


719 


Qy 


101 


Db 


779 


Qy 


121 



MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 20 
I t I I 1 I I I I I I I I I i I I I I I t I I I I I I I i I ! I I I I I I t I I I I I I I I I I I I I I I I I I I I I I. 



Se rTyr I leCysAspThrGlyH isCysCysGlyGlnSe rGlnCysCysAsnTyrTy rTyr 
I I I I t I I I ! I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGCT AC AT CT GT GAC ACAGGAC ACTGCT GT GG AC AGT CT C AGTGCT GC AACT ACT ACT AT 



I I I I I I 1 I I I I I I i I I I I I I I I I I I I I i I t I I I I I I I I I I I I I I I I I I I I i I I I I I I I I 



40 



598 



80 



I I I I I I I I I I I I I I i I I I I I I I I I I I II I I I I I I I I I I I I I I I I I i I i I I I I I I I I I I I I 



TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 718 

LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 100 
I I I II I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 7 78 

ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 120 

I 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 I ! 1 1 1 1 1 1 

CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 8 38 
ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 140 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 839 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 8 98 

Qy 141 GlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerPro 160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 1 I i I I I I I I I I 

Db 899 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 958 

Qy 161 LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 180 

I I I I I I I I I I II I I! I I I I I I II I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I 

Db 959 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 1018 

Qy 181 AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 200 

I I I I I I I I I I I I I I- 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I 
Db 1019 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 1078 

Qy 201 AlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGlu 220 

I I I I I I I It I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I II I I I I I 
Db 107 9 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 1138 

Qy 221 GluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrPro 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1139 GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 1198 . 

Qy 241 GlyArgHisArgArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGly 2 60 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1199 GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 1258 

Qy 261 HisHisAspAspAspLeuLysGluPheAsnThrLeuIleAspAspAlaLeuAspGlyPro 280 

i I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I 
Db 1259 CACC AT GACG AT GACCTC AAAGAGT T CAAC AC ACT CAT CG AT GAT GCT CT GG AT GGGC C C 1318 

Qy 281 LeuAspPheCysAspSerCysHisValArgProProGlyAspGluGluGluGlyLeuCys 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I II 
Db 1319 CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 1378 

Qy 301 GlnSerSerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 320 

I I I I 1 1 I I II I I I I I I I I I I I II I II I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 137 9 CAGT CCTCTGAGGAGC AGGCTCGAGAGCCT GGGC ACCCGC ACCT GCCACGGCCGCCCGCA 14 38 

Qy 321 CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 340 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 14 39 T GCCT GCT GCT G AAC ACC AT C AACGAGC AGGACT CT CC C AACT CCC AG AGCAGC AGCT CC 14 98 

Qy 341 ProSer 342 

I I I I I I 

Db 1499 CCCAGC 1504 

RESULT 3 
ADP48589 

ID ADP48589 standard; cDNA; 4122 BP. 
XX 

AC ADP4858 9; 
XX 

DT 09-SEP-2004 (first entry) 
XX 

DE Full length human 0PAL1 cDNA SeqID 16. 
XX 

KW human; ss; OPAL1; GO; childhood leukaemia; G protein beta 2; Gl; 

KW interleukin-10 receptor alpha; G2; IL-10; FYN-binding protein; PBK1; 

KW cytostatic; acute lymphoblastic leukaemia; ALL; acute myeloid leukaemia; 

KW AML . 

XX 

OS Homo sapiens. 
XX 

PN WO2004053074-A2. 
XX 

PD 24-JUN-2004. 
XX 

PF 05-DEC-2003; 2003WO-US038738 . 
XX 

PR 06-DEC-2002; 2002US-0432064P . 

PR 06-DEC-2002; 2002US-0432077P . 

PR 06-DEC-2002; 2002US-04 32078P . 

PR 14-OCT-2003; 2003US-0510904P . 

PR 14-OCT-2003; 2003US-0510968P . 

PR 05-DEC-2003; 2003US-007298 95 . 
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PR 05-DEC-2003; 2003US-0527610P . 
XX 

PA { SCTE- ) SCI & TECHNOLOGY CORP @UNM. 

PA (SAND-) SAND I A CORP. 

XX 

PI Willman CL, Helmarv P, Verof f R, Mosquera-Caro M, Davidson GS; 

PI Martin SB, Atlas SR f Andries E, Kang H, Shuster JJ, Haaland DM; 

PI Potter JW, Wang X, Harvey R; 
XX 

DR WPI; 2004-468846/44. 
XX 

PT New isolated Outcome Predictor in Acute Leukemia 1 (OPAL1) 

PT polynucleotides and polypeptides for preventing, treating or diagnosing 

PT leukemia, or for screening or evaluating compounds that may be used for 

PT treating leukemia. 

XX 

PS Disclosure; SEQ ID NO 16; 343pp; English. 
XX 

CC This invention relates to a novel isolated nucleic acid identified as 

CC OPAL1 (also known as GO), and the encoded protein thereof. Specifically, 

CC it refers to a method for predicting the outcome and risk classification 

CC of childhood leukaemia. The present invention describes a gene profiling 

CC method that can be used to determine the expression level for 0PAL1, a 

CC gene that is strongly predictive regarding the outcome of bdth acute 

CC lymphoblastic leukaemia (ALL) and acute myeloid leukaemia (AML) . 

CC Furthermore, it provides additional genes namely G protein beta 2 

CC (referred to as Gl), interleukin-10 (IL-10) receptor alpha (referred to 

CC as G2), the FYN-binding protein and PBK1 that can also be used, alone or 

CC in combination, in predictive assays for the classification and potential. 

CC therapy for a leukaemia patient. Accordingly, these compositions exhibit 

CC cytostatic activities. This polynucleotide sequence is the full length 

CC human OPAL1 cDNA of the invention. 
XX 

SQ Sequence 4122 BP; 964 A; 1066 C; 1075 G; 1017 T; 0 U; 0 Other; 
Alignment Scores: 

Pred. No.: " 6.81e-104 Length: 4122 

Score: -1909.00 Matches: 342 

Percent Similarity: 100.0% Conservative: 0 

Best Local Similarity: 100.0% Mismatches: 0 

Query Match: 100.0% Indels: 0 

DB: 12 Gaps: 0 

US-10-729-895A-2 (1-342) x ADP48589 (1-4122) 

Qy 1 MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 20 

I I I I I I I I 1. 1 I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I i I I I I I I I I I I 
Db 14 5 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 204 

Qy 21 SerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I II I I I II I I 
Db 205 AGCT AC AT CT GT GACACAGGACACTGCTGT GGAC AGTCTC AGTGCT GC AACT ACT ACT AT 264 

Qy 41 GluLeuTrpTrpPheTrpLeuValTrpThrllellellelleLeuSerCysCysCysVal 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 265 GAAC T CTGGT GGTT CT GGCT GGT GT GGACC AT CATC AT CAT C CT GAGCT GCT GCT GT GTT 324* 

Qy 61 CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 80 

II I II I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 

Db 325 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 384 

Qy 81 LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 100 

I i I I I I I I I I I 1 I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 385 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 4 44 

Qy 101 ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 120 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 445 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 504 

Qy 121 ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 140 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 505 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 564 

Qy 141 GlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerPro 160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I 
Db 565 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 624 
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Qy 161 LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II i I I I I I I ! I I I I I I I 
Db 625 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 684 

Qy 181 AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 200 

I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I II I I I I I I I I I 1 1 1 1 I I I I I I I I I I M I I I I 
Db 685 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 74 4 

Qy 201 AlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGlu 220 

I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I i 
Db 74 5 GCTGGC CT GGGGGAGCT GGACCCGGGGGCCTT CCTGGACAAAGAT GCAGAAT GT AGGGAG 804 

Qy 221 GluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrPro 240 

I I 1 1 1 1 ! I I I I I I I I I I I I I I I I I I I I I I t I I 1 1 I I 1 1 I I I I I I I I I I I I II I I I I I I M 
Db 805 GAGC T GCT GAAAGAT G AC AGCT CT GAAC AC GGCGC ACC CG AC AGC AAAGAGAAG AC GC CT 8 64 

Qy 241 GlyArgHisArgArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGly 2 60 

I I I I I I I II I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 865 GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 924 

Qy 261 HisHisAspAspAspLeuLysGluPheAsnThrLeuIleAspAspAlaLeuAspGlyPro 280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I 
Db 925 CACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGATGATGCTCTGGATGGGCCC 984 

Qy 281 LeuAspPheCysAspSerCysHisValArgProProGlyAspGluGluGluGlyLeuCys 300 

I I I I I I I I I I t I I I I I I I I I I I I M I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I 
Db 985 CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 104 4 

Qy 301 GlnSerSerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 320 

I I I I I I I I I I I I I I I I I I I I I I.I I I I I I l'l I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 104 5 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 1104 

Qy 321 CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 340 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I i I I I I I I I I I I I t I I I I I I I I I I 

Db 1105 TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 1164' 

Qy 341 ProSer 342 

I I I I I I 

Db 1165 CCCAGC 1170 



RESULT 4 
ADA53255 

ID ADA53255 standard; cDNA; 2703 BP. 
XX 

AC ADA53255; 
XX 

DT 20-NOV-2003 (first entry) 
XX 

DE Human coding sequence, SEQ ID 823. 
XX 

KW Cytostatic; Anti-inflammatory; Osteopathic; Neuroprotective; Nootropic; 

KW Gene Therapy; human; secretory protein; membrane proteins; cancer; 

KW inflammatory disease; osteoporosis; neurological disease; gene; ss. 
XX 

OS Homo sapiens. 
XX 

PN EP1293569-A2. 
XX 

PD 19-MAR-2003. 
XX 

PF 21-MAR-2002; 2002EP-00006586 . 
XX 

PR 14-SEP-2001; 2001 JP-00328381 . 

PR 24-JAN-20O2; 2002US-0350435P . 
XX 

PA (HELI-) HELIX RES INST. 

PA . ( RE AS - ) RES ASSOC BIOTECHNOLOGY. 

XX 

PI Isogai T, Sugiyama T, Otsuki T, Wakamatsu A, Sato H, Ishii S; 

PI Yamamoto J, Isono Y, Hio Y, Otsuka K, Nagai K, Irie R, Tamechika I; 

PI Seki N, Yoshikawa T, Otsuka M, Nagahari K, Masuho Y; 

XX 

DR WPI; 2003-395539/38. 

DR P-PSDB; ADA54894. 
XX 
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PT New polynucleotides encoding full-length polypeptides, e.g. secretory 

PT and/or membrane proteins, useful for developing medicines for diseases in 

PT which the gene is involved, or as target molecules for gene therapy. 

XX 

PS Claim 1; SEQ ID NO 823; 205pp; English. 
XX 

CC The present invention relates to novel human secretory or membrane 

CC proteins (ADA54 072-ADA55710) and their coding sequences (ADA52433- 

CC ADA54071). The coding sequences are useful in the gene therapy of 

CC diseases caused by abnormalities of the proteins, e.g. cancer, 

CC inflammatory diseases, osteoporosis or neurological disease. 
XX 

SQ Sequence 2703 BP; 599 A; 742 C; 713 G; 649 T; 0 U; 0 Other; 
Alignment Scores: 



Pred. No.: 


1.3e-103 


Length : 


2703 


Score: 


1901.00 


Matches: 


340 


Percent Similarity: 


99.7% 


Conservative: 


1 


Best Local Similarity: 


99.4% 


Mismatches : 


1 


Query Match: 


99.6% 


Indels: 


0 


DB: 


10 


Gaps: 


0 



US-10-729-895A-2 (1-342) x ADA53255 (1-2703) 



Qy 


1 


MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 20 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 ! 1 1 1 




Db 


145 


ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 


204 


Qy 


21 


SerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 


40 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 II 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 




Db 


205 


AGCT AC AT CT GT GACACAGGACACTGCT GT GGAC AGT CT C AGTGCT GC AACT ACT ACT AT 


264 


Qy 


4 1 


GluLeuTrpTrpPheTrpLeuValTrpThrllellellelleLeuSerCysCysCysVal 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 




Db 


265 


GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 


324 


Qy 


61 


CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 


80 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 i 1 I 1 1 1 




Db 


325 


TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 


384 


Qy 


8 1 


LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 


100 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


385 


CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 


444 


Qy 


1U 1 


ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


445 


CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


504 


Qy 


121 


ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 


140 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


505 


CC AT AC AGT GCCTT CC AGCT AC AGCAGC AGC AGCT GCT GC CT CC AC AGT GT GGC CCT GC A 


564 


Qy 


141 


GlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerPro 


160 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


565 


GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 


624 


Qy 


161 


LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 


180 


Db 


625 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 


684 


Qy 


181 


AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 


200 






I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


685 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


744 


Qy 


201 


AlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGlu'CysArgGlu 220 






1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


745 


GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 


804 


Qy 


221 


GluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrPro 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


805 


GAGCT GCT GAAAGATGAC AGCT CT GAAC AC GGCG CACC CG AC AGC AAAGAGAAG AC GC CT 


864 


Qy 


241 


GlyArgHisArgArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGly 2 60 






1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


865 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


924 
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Qy 261 HisHisAspAspAspLeuLysGluPheAsnThrLeuIleAspAspAlaLeuAspGlyPro 280 

I I t I I I I I I II t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 925 CACCATGACGAT GACCTC AAAGAGTT CAAC AC ACTC AT CGAT GATGCT CT GGAT GGGCCC 984 

Qy 281 LeuAspPheCysAspSerCysHisValArgProProGlyAspGluGluGluGlyLeuCys 300 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I ! I I 
Db 985 CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 104 4 

Qy 301 GlnSerSerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 320 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I::: 

Db 104 5 CAGCCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCTCA 1104 

Qy 321 CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 340 

II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1105 TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 1164 

Qy 341 ProSer 342 

MINI 

Db 1165 CCCAGC 1170 



RESULT 5 
ADP48576 

ID ADP48576 standard; DNA; 1140 BP. 
XX 

AC ADP4857 6; 
XX 

DT 09-SEP-2004 (first entry) 
XX 

DE Human OPAL1/G0 DNA splice form incorporating exon la SeqID 3. 
XX 

KW human; gene; ds; OPAL1; GO; childhood leukaemia; G protein beta 2; Gl; 

KW interleukin-10 receptor alpha; G2; IL-10; FYN-binding protein; PBK1; 

KW cytostatic; acute lymphoblastic leukaemia; ALL; acute myeloid leukaemia; 

KW AML . 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .1092 

FT /*tag= a 

FT /product= "OPAL1/G0 protein (incorporating exon la)" 
XX 

PN WO2004053074-A2. 
XX 

PD 24-JUN-2004. 
XX 

PF 05-DEC-2003; 2003WO-US038738 . 
XX 

PR 06-DEC-2002; 2002US-04 32064P . 

PR 06-DEC-2002; 2002US-04 32077P . 

PR 06-DEC-2002; 2002US-04 32078P . 

PR 14-OCT-2003; 2003US-0510904P . 

PR 14-OCT-2003; 2003US-0510968P . 

PR 05-DEC-2003; 2003US-0072-9895 . 

PR 05-DEC-2003; 2003US-0527610P . 
XX 

PA (SCTE-) SCI & TECHNOLOGY CORP 8UNM. 

PA (SAND-) SAND I A CORP. 

XX 

PI Willman CL, Helman P, Veroff R, Mosquera-Caro M, Davidson GS; 

PI Martin SB, Atlas SR, Andries E, Kang H, Shuster JJ, Haaland DM; 

PI Potter JW, Wang X, Harvey R; 
XX 

DR WPI; 2004-468846/44. 

DR P-PSDB; ADP48577. 
XX 

PT New isolated Outcome Predictor in Acute Leukemia 1 (OPAL1) 

PT polynucleotides and polypeptides for preventing, treating or diagnosing 

PT leukemia, or for screening or evaluating compounds that may be used for 

PT treating leukemia. 

XX 

PS Claim 1; SEQ ID NO 3; 343pp; English. 
XX 

CC This invention relates to a novel isolated nucleic acid identified as 

CC OPAL1 (also known as GO), and the encoded protein thereof. Specifically, 

CC it refers to a method for predicting the outcome and risk classification 
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CC of childhood leukaemia. The present invention describes a gene profiling 
CC method that can be used to determine the expression level for OPAL1, a 
CC gene that is strongly predictive regarding the outcome of both acute 
CC lymphoblastic leukaemia (ALL) and acute myeloid leukaemia (AMD . 
CC Furthermore, it provides additional genes namely G protein beta 2 
CC (referred to as Gl ) , in terleukin-10 (IL-10) receptor alpha (referred to 
CC as G2), the FYN-binding protein and PBK1 that can also be used, alone or 
CC in combination, in predictive assays for the classification and potential 
CC therapy for a leukaemia patient. Accordingly, these compositions exhibit 
CC cytostatic activities. This polynucleotide sequence is the human 0PAL1/G0 
CC splice from incorporating exon la of the invention. 
XX 

SQ Sequence 1140 BP; 258 A; 367 C; 314 G; 201 T; 0 U; 0 Other; 
Alignment Scores: 

Pred. No.: 4.7e-102 Length: 1140 

Score: 1868.00 Matches: 334 

Percent Similarity: 100.0% Conservative: 0 

Best Local Similarity: 100.0% Mismatches: 0 

Query Match: 97.9% Indels: 0 

DB: 12 Gaps: 0 

US-10-729-895A-2 (1-342) x ADP48576 (1-1140) 

GlnAspLysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHis 28 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I 
CAGGAT AAGGAAGC CT GT GT GGGT ACCAAC AATC AAAGCT AC ATCT GT GACACAGGAC AC 147 

CysCysGlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuVal 4 8 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 



TrpThrllellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHis 68 
I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

T GG ACC AT CAT CAT CAT C CT GAGCT GCT GCT GT GTT T GCC ACC AC C GC CG AGCC AAGC AC 267 

ArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArgGluAlaHis 88 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 327 

AsnTyrSerAlaLeuProPheTyrPheArgPheLeuProAsnTyrLeuLeuProProTyr 108 
I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 387 

GluGluValValAsnArgProProThrProProProProTyrSerAlaPheGlnLeuGln 128 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 4 47 



Qy 


9 


Db 


88 


Qy 


29 


Db 


148 


Qy 


49 


Db 


208 


Qy. 


69 


Db 


268 


Qy 


89 


Db 


328 


Qy 


109 


Db 


388 


Qy 


129 


Db 


448 


Qy 


149 


Db 


' 508 


Qy 


169 


Db 


568 


Qy 


189 


Db 


628 


Qy 


209 


Db 


688 


Qy 


229 


Db 


748 


Qy 


249 


Db 


808 



I I M I I I I I I ! I I I I I I I II I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 1 I I I I 



148 
507 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I 
CCC ACC AGGGGATCCCAGGGGGCACAGAGC AGCC CCTTGTCTGAGCCCAGCAGAAGCAGC 567 



188 ' 



I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I II II I I I I I I I I i I I I I I I I I I I I I I I 



I I I I I I I I I I I i I II I 1 I i I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I 



687 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 i 1 1 

GGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTCT 747 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 807 

AspSerGlylleGluValCysValCysAsnArgGlyHisHisAspAspAspLeuLysGlu 268 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 8 67 
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Qy 269 PheAsnThrLeuIleAspAspAlaLeuAspGlyProLeuAspPheCysAspSerCysHis 288 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 868 TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 927 

Qy 28 9 ValArgProProGlyAspGluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArg 308 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I ! I I I I 

Db 928 GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGA 987 

Qy 309 GluProGlyHisProHisLeuProArgProProAlaCysLeuLeuLeuAsnThrlleAsn 328 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1. 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 988 GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 104 7 

Qy 329 GluGlnAspSerProAsnSerGlnSerSerSerSerProSer 342 

I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 104 8 GAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGC 108 9 

RESULT 6 ■ 
AAS44951 

ID AAS44951 standard; cDNA; 1508 BP. 
XX 

AC AAS44 951; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE cDNA encoding novel human secretory protein, Seq ID No 32. 
XX 

KW Human; secreted protein; arthritis; Crohn's disease; sepsis; shock; 

KW ischaemia-reperf usion injury; haematopoiesis ; cancer; neuropathy; 

KW transgenic animal; Alzheimer's disease; Parkinson's disease; burn; 

KW amyotrophic lateral sclerosis; platelet disorder; thrombocytopenia; 

KW ulcer; osteoporosis; bone degenerative disorder; periodontal disease; 

KW gut protection; lung; liver fibrosis; immune deficiency; infection; 

KW severe combined immunodeficiency; SCID; autoimmune disorder; allergy; 

KW multiple sclerosis; rheumatoid arthritis; diabetes mellitus; asthma; 

KW fertility; analgesic; pain; antigen; ss. 
XX 

OS Homo sapiens. 
XX 

PN WO200166689-A2. 
XX 

PD 13-SEP-2001. 
XX 

PF 05-MAR-2001; 2001WO-US004 942 . 
XX 

PR 07-MAR-2000; 2000US-00519705 . 

PR 19-MAY-2000; 2000US-005744 54 . 

PR 17-JUN-2000; 2000US-00596193 . 

PR 14-JUL-2000; 2000US-00616847 . ' 

PR 19-SEP-2000; 2000US-00665363 . 

PR 20-OCT-2000; 2000US-00693267 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT , Liu C, Asundi V, Xu C, Wehrman T, Ren F, Ma Y, Zhou P; 

PI Zhao QA, Yang Y, Drmanac RT, Zhang J, Chen R, Xue AJ, Wang J; 
XX 

DR WPI; 2001-589934/66. 

DR P-PSDB; AAU28051. 
XX 

PT Novel polypeptides and nucleic acids obtained from cDNA libraries 

PT prepared from various human tissues, for diagnosis and treatment of 

PT cancer, neurological, inflammatory, and autoimmune disorders. 
XX 

PS Claim 1; SEQ ID NO 32; 107pp; English. 
XX 

CC The invention relates to novel isolated human secreted polypeptides (I) 

CC and polynucleotides (II). (I) and (II) are useful for treating 

CC inflammatory conditions such as arthritis, nephritis, Crohn's disease, 

CC ischaemia-reperfusion injury, shock, sepsis, immune responses, and is 

CC involved in increasing haematopoiesis, stem cell survival, bone growth 

CC and remodeling. (I), (II) and modulators of (II) are useful for 

CC prophylaxis or treatment of one or more cancers. (II) is also useful for 

CC creating transgenic animals useful for studying the in vivo activities of 

CC the polypeptide as well as for studying modulators of the polypeptides. 

CC (I) induces the proliferation of neural cells and regeneration of nerve 

CC and brain tissue and is useful for the treatment of central and 
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CC peripheral nervous system diseases and neuropathies, such as Alzheimer's, 

CC Parkinson's disease, Huntington's disease, and amyotrophic lateral 

CC sclerosis. In addition, (I) is involved in chemotactic or chemokinetic 

CC activity, regulation of haema topoiesis and is useful for treating myeloid 

CC or lymphoid cell disorders, platelet disorders such as thrombocytopenia 

CC and for regeneration of bone, cartilage, tendon, ligament and/or nerve 

CC tissue growth, and in tissue repair, healing of burns, incisions, ulcers, 

CC for treating osteoporosis, osteoarthritis, bone degenerative disorders, 

CC or periodontal disease. Furthermore, (I) is also useful for gut 

CC protection or regeneration and treatment of lung or liver fibrosis, 

CC reperfusion injury in various tissues, various immune deficiencies and 

CC disorders including severe combined immunodeficiency (SCID) , bacterial or 

CC fungal infections, autoimmune disorders e.g. multiple sclerosis, 

CC rheumatoid arthritis, diabetes mellitus, myasthenia gravis, allergic 

CC reactions and conditions, such as asthma or other respiratory problems. 

CC In addition, (I) affects biorhythms or circadian cycles of rhythms, 

CC fertility, metabolism, catabolism, anabolism, storage or elimination of 

CC dietary fat, lipid, protein, carbohydrate, vitamins, minerals, provides 

CC analgesic effects or other pain reducing effects, immunoglobulin like 

CC activity and can act as an antigen in a vaccine composition to raise an 

CC immune response. AAS4 4 920-AAS4 52 95 represent novel human secreted protein 

CC coding sequences of the invention 

XX 

SQ Sequence 1508 BP; 345 A; 447 C; 423 G; 293 T; 0 U; 0 Other; 



Alignment Scores: 








Pred. No.: 


1.25e-101 


Length: 


1508 


Score: 


1863.00 


Matches : 


333 


Percent Similarity: 


99.7% 


Conservative : 


0 


Best Local Similarity: 


99.7% 


Mismatches : 


1 


Query Match: 


97.6% 


Indels : 


0 


DB: 


5 


Gaps: 


0 



US-10-729-895A-2 (1-342) x AAS44951 (1-1508) 

GlnAspLysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHis 28 
I I I I M i I I I I I II I I I I I I I I I I I I I ! I 1 I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 

CAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAAAGCTACATCTGTGACACAGGACAC 233 

CysCysGlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuVal 4 8 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
T GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT GAACT CT GGT GGTT CT GGCT GGT G 293 

TrpThrllellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHis 68 
I I I I II I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 353 



I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I 1 I I I I I I I I 
CGCCTT CAGGCCCAGC AGCGGC AAC ATGAAAT C AAC CT GAT CGCTT AC CG AGAAGCC C AC 413 

AsnTyrSerAlaLeuProPheTyrPheArgPheLeuProAsnTyrLeuLeuProProTyr 108 
I I I I I I II I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I t I I I 
AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 4 73 

GluGluValValAsnArgProProThrProProProProTyrSerAlaPheGlnLeuGln 128 
I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTAcAG 533 

GlnGlnGlnLeuLeuProProGlnCysGlyProAlaGlyGlySerProProGlylleAsp 14 8 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
CAGCAGCAGCTGCTGCCTCC ACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 593 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 1 1 I I 1 1 I i I I ! I I I I I I I I I I I 1 1 I I 

CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTT GT CT GAGC CC AGC AGAAGC AGC 653 

ThrArgProProSerlleAlaAspProAspProSerAspLeuProValAspArgAlaAla 188 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 713 

ThrLysAlaProGlyMetGluProSerGlySerValAlaGlyLeuGlyGluLeuAspPro 208 
I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 773 

GlyAlaPheLeuAspLysAspAlaGluCysArgGluGluLeuLeuLysAspAspSerSer 228 





9 


Db 


174 


Qy 


29 


Db 


234 


Qy 


49 


Db 


294 


Qy 


69 


Db 


354 


Qy 


89 


Db 


414 


Qy 


109 


Db 


474 


Qy 


129 


Db 


534 


Qy 


149 


Db 


594 


Qy 


169 


Db 


654 


Qy 


189 


Db 


714 


Qy 


209 
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Db 


774 


Qy 


229 


Db 


834 


Qy 


249 


Db 


894 


Qy 


269 


Db 


954 


Qy 


289 


Db 


1014 


Qy 


309 


Db 


1074 


Qy 


329 


Db 


1134 


RESULT 7 

ADL61901 

ID ADL61901 

XX 

AC ADL61901 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGGGCCTTCCTGGACAAAGATGCAGAATGT AGGGAGGAGCTGCTGAAAGATGACAGCTCT 833 



I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I ! I ! I I ! I I I I I ! I I I I I I I I I I ! I I I I I I 

GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 8 93 



2 68 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I 

GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 



953 



PheAsnThrLeuIleAspAspAlaLeuAspGlyProLeuAspPheCysAspSerCysHis 288 

llllllllll I IIIMIH II II II Illllll MMII 

TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 1013 

ValArgProProGlyAspGluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArg 308 

II I M I I 1 1 I II I I I I II II I 1 I I M I I I I II I Milt I I II II 

GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGCCCTCTGAGGAGCAGGCTCGA 107 3 

GluProGlyHisProHisLeuProArgProProAlaCysLeuLeuLeuAsnThrlleAsn 328 
I I I I I I I I M I I I I I I I I I I I I M I I M II II I I I I I I I I I I I I I I M I I M I I I I I I M 
GAGC CT GGGC ACCCGC AC CT GC CACGGC CGCC CGCATGCCT GCT GCT GAACACC AT CAAC 1133 

GluGlnAspSerProAsnSerGlnSerSerSeirSerProSer 342 

I I I I I I M I I I I I I I I I I i I I I I I I I M II M I I I I II I I t I 

GAGC AGGACT CT CCCAACTC CC AGAGC AGC AGCT CC CC C AGC 1175 



DNA; 3095 BP. 



start | next page 
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SCORE Search Results Details for Application 

10729895 and Search Result 
20070112_180205._us-10-729-895a-2.rge- 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 
20070112_180205_us-10-729-895a-2.rge. 

start I next page 

Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Bioccelera tion Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: January 16, 2007, 05:01:00 ; Search time 5325 Seconds 

(without alignments) 
6160.568 Million cell updates/sec 

Title: US-10-729-895A-2 
Perfect score: 1909 

Sequence: 1 MPFLLGLRQDKEACVGTNNQ LLNTINEQDSPNSQSSSSPS 342 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Pgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 6366136 seqs, 31973710525 residues 

Total number of hits satisfying chosen parameters: 12732272 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 4 5 summaries 



Command line parameters: 
-MODEL=frame+_p2n .model -DEV=xlp 

-Q=/abss/ABSSWEB_spool/US10729895/runat_12012007_180205_7317/app_query. fasta_l 
-DB=GenEmbl -QFMT=fastap -SUFFIX=rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=4 5 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSI ZE=500 -MINLEN=0 -MAXLEN=2000000000 -HOST=abss03p 
-USER=US10729895_@CGN_l_l_8328_@runat_12012007_180205_7317 -NCPU=6 -ICPU=3 
-N0_MMAP -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG -DEV_TIMEOUT=120 
-WARN_TIME0UT=30 -THREADS- 1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP=6 -FGAPEXT=7 
-YGAPOP^IO -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database : 



GenEmbl : * 


1 


gb_env: * 


2 


gb_pa t : * 


3 


gb ph:* 


4 


gb_pl:* 


5 


gb_pr:* 


6 


. gb_ro:* 


7 


gb_sts :* 


8 


gb_sy: * 


9 


gb un:* 


10: gb_vi:* 


1 1 : gb_ov : * 
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12: gb_htg:+ 

13: gb_in: + 

14: gb_om:* 

15: gb_ba:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 





No . 


Score 


Match 


Length 




1 U 


Descr ip tion 




1 


l yuy 


100. 


0 


z y / o 


Z 


Aajj / ybb 


Aajj,/ jod jequence 




z 


l yui 


99. 


6 


Z 'Uj 


z 


BV71 A 1 "iQ 

aa / ± 4 i jy 


71 VI 1 / 1 TO C»/ti i ^»r\ /-^ 

aa / 1 4 1 j 3 bequence 




1 
D 


i yui 


99. 


6 


11 Oil 

Z / UJ 


D 


AKUDbZoD 


ai\UDozod nomo sapi 




4 


18 63 


97. 


6 


lUUz 


2 


/ zuz yu 


cy/zuzyu bequence 




5 


18 63 


97. 


6 


juyo 


2 


CQ4 1 JU4 Z 


cy4ioU4z bequence 




6 


184 6 


96. 


7 


4054 


Z 


BUz / DUl d 


dUz/duij du Human 




7 


1618 


84. 


8 


O O A 1 

Zz4 i 


b 


bLUz b Jby 


bCUzbDby mus muscu 




8 


1618 


84. 


8 


3921 


6 


BC05894 9 


BC058 94 9 Mus muscu 




9 


1353 


70. 


9 


I JI / JJ 


c 
D 


aljdo / yu 


alj jo /yu Human una 




10 


1353 


70. 


9 


17 37 58 


5 


AC00914 4 


Acuuy 14 4 Homo sapi 




1 1 


1294 


67. 


8 


62035 


5 


ACU / o y d / 


Acu/dyj/ Homo sapi 




12 


1293 


67. 


7 


1 O Q O C "3 

ZZoZ b J 


12 


acu / y y ju 


Acu/yyju Homo sapi 




13 


1275 


66. 


8 


1702 61 


5 


AC1 4 / y / 4 


ACi4/y/4 ran trogi 




1 4 


1172 


61. 


4 


2217 


5 


AKUUUJ /4 


AKUUUo/4 Homo sapi 




15 


1159 


60. 


7 


275121 


12 


AC0994 20 


ACUy y 4 zU Rattus no 


c 


16 


1149.5 


60. 


2 


206019 


6 


AC 1618 6b 


AClblobD Mus muscu 




17 


114 9.5 


60. 


2 


216311 


6 


AC1 56982 


AC156982 Mus muscu 




18 


1085 . 5 


56. 


9 


196003 


1 2 


AC07 94 19 


AC079419 Mus muscu 


c 


19 


1025 . 5 


53. 


7 


74874 


12 


AC 168985 


AC168985 Bos tauru 




20 


1017.5 


53. 


3 


150528 


1 2 


AC 14 924 2 


ACi4yz4z Macaca mu 




21 


1017.5 


53. 


3 


1 6604 4 


12 


AC 1 D U / 1 D 


acidU/1d Macaca mu 


c 


22 


1017.5 


53. 


3 


1764 36 


12 


AC 14 8535 


AC148535 Macaca mu 


c 


23 


994.5 


52. 


1 


239579 


12 


AcuyDDy / 


AcuyDDy / Kattus no 


c 


24 


994.5 


52. 


1 


24 7659 


12 


ACUy DD4 o 


ACuybD4o Rattus no 




ZD 


yyz . d 


51. 


5 


Z4 / 1 by 


Iz 


AC Iz jUU / 


ACizJUU/ Kattus no 


c 


z b 


Q 1 Q 

y io 


48. 


1 


DUO 


z 


cyu / 1 o bU 


cyu/iobu beguence 


c 


27 


918 


48. 


1 


DUD 


Z 


cyiuz4 y b 


CQ102496 Sequence 


c 


28 


918 


48. 


1 


505 


2 


/~*/'M / 1 / AA 

CQ1 4 14Uy 


CQ141409 Sequence 


c 


29 


918 


48. 


1 


505 


2 


CQ177047 


CQ177047 Sequence 


c 


30 


918 


48. 


1 


505 


2 


CQ224699 


CQ224 699 Sequence 


c 


31 


918 


48. 


1 


505 


2 


CQ262712 


CQ262712 Sequence 


c 


32 


918 


48. 


1 


505 


2 


CQ299773 


CQ299773 Sequence 


c 


33 


918 


48. 


1 


505 


2 


CQ336943 


CQ336943 Sequence 




34 ' 


819.5 


42. 


9 


1479 


11 


BC095609 


BC095609 Danio rer 


c 


35 


768.5 


40. 


3 


84108 


12 


AC133385 


AC133385 Rattus no 




36 


768.5 


40. 


3 


166247 


12 


AC135876 


AC135876 Rattus no 


c 


37 


768.5 


40. 


3 


218780 


12 


AC115552 


AC115552 Rattus no 




38 


753 


39. 


4 


28657 


12 


AC153482 


AC153482 Homo sapi 


c 


39 


753 


39. 


4 


35944 


5 


HSTCRB1H 


AF009664 Homo sapi 




40 


753 


39. 


4 


67826 


5 


AC099395 


AC099395 Homo sapi 


c 


41 


753 


39. 


4 


161425 


5 


AC104597 


AC104 597 Homo sapi 


c 


42 


753 


39. 


4 


232650 


5 


U66061 


U66061 Human germl 




43 


751 


39. 


3 


439 


2 


CQ407492 


CQ407 4 92 Sequence 




44 


652 


34. 


2 


984 


2 


CQ716136 


CQ716136 Sequence 




45 


648 


33. 


9 


169309 


6 


AL731692 


AL731692 Mouse DNA 



ALIGNMENTS 



RESULT 1 

AX337966 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 



AX337966 
Sequence 
AX337966 
AX337966, 



7 from Patent 



2976 bp 
WO0194391 . 



1. GI:18128677 



DNA 



linear 



PAT 09-JAN-2002 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebrata; Eu teleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 
Hominidae; Homo. 
1 
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AUTHORS Yue, H . , He, A., Nguyen, D.B., Yao, M. G. , Bandman,0., Burford,N., 

Tang,Y.T., Xu,Y., Hafalia,A., Azimzai,Y. and Walia,N.K. 
TITLE Intracellular signaling proteins 

JOURNAL Patent: WO 0194391-A 7 13-DEC-2001; 
Incyte Genomics, Inc. (US) 
FEATURES Location/Qualifiers 
source 1. .2976 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db_xref="taxon : 9606" 
/note=" Incyte ID No: 1478005CB1" 

ORIGIN 

Alignment Scores: 



Pred. No. : 


2.84e-128 


Length: 


2976 


Score: 


1909.00 


Matches : 


342 


Percent Similarity: 


100.0% 


Conservative: 


0 


Best Local Similarity: 


100.0% 


Mismatches: 


0 


Query Match: 


100.0% 


Indels: 


0 


DB: 


2 


Gaps: 


0 



US-10-729-895A-2 (1-342) x AX337966 (1-2976) 



Qy 


l 


Me tProPheLeuLeuGlyLeuArgGl nAspLysGluAlaCysValGlyThrAsnAsnGln 


o n 
ZU 






i i i i i i i i i i i i i i i i i i i \ i t t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i 
1 1 1 1 1 II 1 1 1 1 i 1 1 II M 1 1 It 1 1 II 1 II 1 II II II 1 II II II 1 1 II II 1 II 1 II ! 1 i II 




Db 


479 


ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 


538 


Qy 


Z 1 


SerTyr IleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 


H u 






i i t i i i i i i i i \ i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 II 1 II 1 II 1 1 II 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I 1 1 1 1 1 1 II 




Db 


539 


AGCT AC AT CT GT GACAC AGG AC ACT GCT GT GG AC AGT C T C AGT GCT GC AACT ACT ACT AT 


598 


Qy 


A 1 


Gl uLeuT r pT rp PheT rpLe uVa IT r pT h r 1 1 e I le 1 1 e I le Le u Se rCy sCys Cy sVa 1 


o u 






i i i i i i i > i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i < i i i i i i i i i i t i i t i i i i i 
1 1 I 1 1 II 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 II 1 II 1 1 I 1 II II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 




Db 


599 


GAACTCTGGT GGTT CT GGCT GGTGTGGACC AT C ATC AT CATCCT GAGCTGCTGCTGTGTT 


658 


Qy 


6 1 


CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 


O U 






i i i i i i i i i i i i i ) i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 I 1 1 1 II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


659 


TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 


718 


Qy 


Q 1 
O 1 


Leu i leMd i yrnrybiUnianisAsni yrjc rniaLeurrornei yrtrne/\rg rneLeu 


1 nn 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i t i i i i i i i 
II 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 t 1 II I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


719 


CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 


778 


wy 


101 




120 






1 1 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 \ M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I* 1 1 1 1 1 1 1 I 1 1 1 1 M 1 1 It 1 1 
1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 M 1 1 




Db 


779 


CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


838 


Ov 


121 


ProTyrSerAla PheGlnLe uGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 


140 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 M II 1 1 I 1 1 




Db 


839 


CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 


898 


Qy 


141 


GlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerPro 


160 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


899 


GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 


958 


Qy 


161 


LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 


180 






1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


959 


TT GT CT GAGC CC AGCAGAAG CAGC AC AAGACCCCCAAGC ATC GCT G AC CCTG AT CCCTCT 


1018 


Qy 


181 


AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 


200 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1019 


GACCT ACC AGTTGACC GAGC AGCC AC CAAAGCCCCAGGGATGGAGCCC AGT GGCT CTGTG 


1078 


Qy 


201 


AlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGlu 


220 






1 1 1 1 1 1 1 1 1 1 1 1 it it ii 1 1 1 1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1079 


GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 


1138 


Qy 


221 


GluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrPro 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1139 


GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 


1198 


Qy 


241 


GlyArgHisArgArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGly 


260 






1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1199 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


1258 


Qy 


261 


HisHisAspAspAspLeuLysGluPheAsnThrLeuIleAspAspAlaLeuAspGlyPro 


280 
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Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 
Db 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1259 CACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGATGATGCTCTGGATGGGCCC 



1318 



281 LeuAspPheCysAspSerCysHisValArgProProGlyAspGluGluGluGlyLeuCys 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
1319 CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 1378 



301 



320 



GlnSerSerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I t 1 .1 I I I I I 

137 9 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 14 38 

321 CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 340 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
14 39 TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 14 98 



341 ProSer 
MINI 
14 99 CCCAGC 



342 



1504 



RESULT 2 
AX714139 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



FEATURES 

source 



AX714139 2703 bp DNA linear PAT 15-APR-2003 

Sequence 823 from Patent EP1293569. 

AX714139 

AX714139.1 GI:29889067 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 

Isogai,T., Sugiyama,T., Otsuki,T., Wakamatsu, A. , Sato,H., Ishii,S., 
Yamamoto, J. I . , Isono,Y., Hio, Y . , Otsuka,K., Nagai,K., Irie,R., 
Tamechika, I . , Seki,N., Yoshikawa ,T . ,, Otsuka,M., Nagahari,K. and 
Masuho, Y. 
•Full-length cDNAs 

Patent: EP 1293569-A 823 19-MAR-2003; 

Helix Research Institute (JP) ; Research Association for 
Biotechnology (JP) 

Location/Qualifiers 

1. .2703 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref="taxon:9606" 



ORIGIN 



Alignment Scores: 



Pred. No.: 


9.67e-128 


Length : 


2703 


Score : 


1901 .00 


Matches : 


340 


Percent Similarity: 


99.7% 


Conservative : 


1 


Best Local Similarity: 


99.4% 


Mismatches : 


. 1 


Query Match: 


99.6% 


Indels: 


0 


DB: 


2 


Gaps: 


0 



US-10-729-895A-2 (1-342) x AX714139 (1-2703) 



MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 20 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
ATGCCTTTCCTTTTGGGTCTTAGACAGGATTAAGGAAGCCTGTGTGGGTACCAACAATCAA 204 

SerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 4 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 2 64 



Qy 


1 


Db 


145 


Qy 


21 


Db 


205 


Qy 


41 


Db 


265 


Qy 


61 


Db 


325 


Qy 


81 


Db 


38 5 



60 



I I I I I I II II I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 



324 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



384 



LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 



100 



444 
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Qy 

Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



101 ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I II I 
44 5 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 504 

121 ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 140 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

505 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 564 

14 1 GlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerPro 160 

I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
565 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 624 



161 LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

625 TTGTCTGAGCGCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 



180 



684 



200 



181 AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 
I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

685 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 744 

201 AlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGlu 220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
74 5 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 804 

221 GluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrPro 240 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
805 GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 864 

24 1 GlyArgHisArgArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGly 260 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I t I I I I I I I I 

865 GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 924 

261 HisHisAspAspAspLeuLysGluPheAsnThrLeuIleAspAspAlaLeuAspGlyPro 280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
925 CACC AT GACGAT GACCTCAAAGAGTT CAAC AC ACTC AT CGAT GATGCT CT GGAT GGGCCC 984 

281 LeuAspPheCysAspSerCysHisValArgProProGlyAspGluGluGluGlyLeuCys 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
985 CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 104 4 

301 GlnSerSerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 320 
111 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I : : : 
104 5 CAGCCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCTCA 1104 



Qy 321 CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 1105 T GCC T GCT GCT GAAC ACC AT CAAC GAGC AGGACT CT CC C AACT CC C AG AGC AGC AGCT CC 1164 

Qy 341 ProSer 342 

I II I II 

Db 1165 CCCAGC 1170 



RESULT 3 
AK056285 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 



PRI 20-JAN-2006 
weakly similar 



AK056285 2703 bp mRNA linear 

Homo sapiens cDNA FLJ31723 fis, clone NT2RI2006682 , 
to Human WW domain binding protein-1 mRNA. 
AK056285 

AK056285.1 GI:16551642 

oligo capping; fis (full insert sequence). 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Catar'rhini; 
Hominidae; Homo. 
1 

Ota,T., Suzuki, Y., Nishikawa,T . , Otsuki,T., Sugiyama,T., Irie,R., 
Wakamatsu, A. , Hayashi,K., Sato,H., Nagai,K., Kimura,K., Makita,H. 
Sekine,M., Obayashi,M., Nishi,T., Shibahara, T . , Tanaka,T., 
Ishii,S., Yamamoto,J., Saito,K., Kawai,Y., Isono,Y., Nakamura,Y., 
Nagahari,K., Murakami, K., Yasuda,T., Iwayanagi,T . , Wagatsuma ,M. , 
Shiratori, A. , Sudo,H., Hosoiri,T., Kaku,Y., Kodaira,H., Kondo,H., 
Sugawara,M., Takahashi ,M . , Kanda,K., Yokoi,T., Furuya,T., 
Kikkawa,E., Omura,Y., Abe,K., Kamihara,K., Katsuta,N., Sato,K., 
Tanikawa,M., Yamazaki,M., Ninomiya,K., Ishibashi , T . , Yamashita,H. 



http ://es/ScoreAccessWeb/GetItem.action?AppId= 1 0729895&seqld= 1 048743&ItemName. . . 1/22/2007 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



Murakawa,K., Fujimori, K. , Tanai,H., Kimata,M., Watanabe,M., 
Hiraoka,S., Chiba,Y., Ishida,S., Ono,Y., Takiguchi , S . , Watanabe,S., 
Yosida,M., Hotuta,T., Kusano,J., Kanehori,K., Takahashi-Fu jii, A. , 
Hara # H., Tanase,T.O., Nomura, Y . , Togiya,S., Komai,F., Hara,R., 
Takeuchi,K., Arita,M., Imose,N., Musashino, K . , Yuuki,H., Oshima,A., 
Sasaki, N., Aotsuka,S., Yoshikawa , Y . , Matsunawa , H . , Ichihara,T., 
Shiohata,N., Sano,S., Moriya,S., Momiyama,H., Satoh,N., Takami,S., 
Terashima, Y. , Suzuki, O., Nakagawa,S., Senoh,A., Mizoguchi , H . , 
Goto,Y., Shimizu,F., Wakebe,H., Hishigaki, H . , Watanabe,T., 
Sugiyama,A., Takemoto,M., Kawakami,B., Yamazaki,M., Watanabe,K., 
Kumagai,A., Itakura,S., Fukuzumi,Y., Fujimori, Y. , Komiyama,M., 
Tashiro,H., Tanigami,A., Fujiwara,T., Ono,T., Yamada,K., Fujii,Y., 
Ozaki,K., Hirao,M., Ohmori,Y., Kawabata,A., Hikiji,T., Kobatake, N . , 
Inagaki,H., Ikema,Y., Okamoto,S., Okitani,R., Kawakami,T., 
Noguchi,S., Itoh,T., Shigeta,K., Senba,T., Matsumura , K. , 
Nakajima,Y., Mizuno,T., Morinaga,M., Sasaki, M. , Togashi,T., 
Oyama,M:, Hata,H., Watanabe,M., Komatsu,T., Mizushima-Sugano, J . , 
Satoh,T., Shirai,Y., Takahashi, Y . , Nakagawa,K., Okumura,K., 
Nagase,T., Nomura, N., Kikuchi,H., Masuho,Y., Yamashita, R. , 
Nakai,K., Yada,T., Nakamura,Y., Ohara,0., Isogai,T. and Sugano,S. 
Complete sequencing and characterization of 21,243 full-length 
human cDNAs 

Nat. Genet. 36 (1), 40-45 (2004) 

14702039 

2 

Ishibashi,T . , Kanehori,K., Yosida,M., Watanabe,S., Ishida,S., 
Ono,Y., Hotuta,T., Hiraoka,S., Murakawa,K., Takiguchi, S . , 
Kusano,J., Watanabe,M., Fujimori, K., Tanai,H., Ishida,M., 
Yamashita, H. , Chiba,Y., Sugiyama,T., Irie,R., Otsuki,T., Sato,H., 
Wakamatsu, A. , Ishii,S., Yamamoto,J., Isono,Y., Kawai-Hio, Y. , 
Saito,K., Nishikawa, T . , Kimura,K., Matsuo,K., Nakamura,Y., 
Sekine,M., Kikuchi,H., Kanda,K., Wagatsuma,M. , Takahashi-Fu jii, A. , 
Oshima,A., Sugiyama,A., Kawakami,B., Suzuki, Y. , Sugano,S., 
Nagahari,K., Masuho,Y., Nagai,K. and Isogai,T. 
NEDO human cDNA sequencing project 
Unpublished 
3 (bases 1 to 2703) 

Isogai,T., Otsuki,T. and Sugiyama,T. 
Direct Submission 

Submitted ( 24-OCT-2001 ) Takao Isogai, Helix Research Institute, 
Genomics Laboratory; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan 
(E-mail: flj-cdna@nifty.com, Tel : 81-4 38-52-3975, Fax:81-4 38-52-3986) 
NEDO human cDNA sequencing project supported by Ministry of 
Economy, Trade and Industry of Japan; cDNA full insert sequencing: 
Research Association for Biotechnology (RAB) ; cDNA library 
construction: Helix Research- Institute (HRI) (supported by Japan 
Key Technology Center etc.); 5*- & 3 f -end one pass sequencing: RAB, 
HRI, and Biotechnology Center, National Institute of Technology and 
Evaluation; clone selection for full insert sequencing: RAB and 
HRI. 

Location/Qualifiers 
1. .2703 

/organism="Homo sapiens" 

/ mo 1_ ty pe = " mRN A " 

/ db_x re f = " taxon : 9 60 6 " 

/clone="NT2RI2006682" 

/cell_line="NT2 M 

/cell_type="teratocarcinoma" 

/ clone_lib= "NT2 RI 2 " 

/note="cloning vector: pME18SFL3 

mRNA from NT2 neuronal precursor cells treated 2-weeks 
mitotic inhibitor after 5-weeks retinoic acid (RA) 
induction . 
majorly NT2 neuron" 



ORIGIN 



Alignment Scores: 

Pred. No.: 9.67e-128 Length: 2703 

Score: 1901.00 , Matches: 340 

Percent Similarity: 99.7% Conservative: 1 

Best Local Similarity: 99.4% Mismatches: 1 

Query Match: 99.6% Indels: 0 

DB: 5 Gaps: 0 



US-10-729-895A-2 (1-342) x AK056285 (1-2703) 



http://es/ScoreAccessWeb/Gefltem.action?AppId=10729895&seqId=10487 1/22/2007 



Ov 
vy 


1 


Me tProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 


20 






1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 I 1 1 1 1 i 1 I i 1 1 1 1 




Db 


145 


ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 


204 


Ov 
vy 


21 


SerTyr IleCysAspThrGlyHisCysCysGlyGlnSe rGlnCysCysAsnTyrTyrTyr 


40 






1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 




Db 


205 


AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 


264 


Ov 
vy 


4 1 


GluLeuTrpTrpPheTrpLeuValTrpThr I lei le I lei leLeuSerCysCysCysVal 


60 






1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


265 


GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 


324 


Ov 
vy 


61 


Cv^H i «?H i «?A rnArnA 1 ^ T.v<=;H i ^ A rnTiPiif?! nAl^Gl nGl n A rrrf?l nHi sGluTl eAsn 

jr OiiX<3n j, c?/v x y iV-i, * \ x G jjy oni o^vj. y xj ^ u wx i ir\ j_ o uj.i x a i.t"l.x vjwiiiiiuwiui j. * 


80 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 




Db 


325 


TGCCACCACCGCCGAGCC7AAGCACCGCCTTCAGGCCCAGCAGCGGC7\ACATG7AAATC7\AC 


384 


Ov 

vy 


81 


T.ph T 1 pAI aTvr Arnfil nAl i ^A^nTvr^p rAI aLpnP roPhpTvr PhpA rnPhpLpu 


100 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


385 


CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 


444 


Ov 

vy 


101 


PrnA^nTvrTiPiiTiPnP mPrnTvrni nftl iiVa T V;* 1 A^nA mPmP rnTh rP rnProP rn 

t i. \Jr\ OU 1 y LUC UUC X XT X \J ± yi, Ul KL\J 1 U V a 1 V a i. AO 1 1/V 1 L i. VJL X W 1 11 1 L X \J t X w L X V_J 


120 






I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 t I 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 




Db 


445 


CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


504 


Ov 

vy 


121 


PrnTvr^p rAI a Phpfil nT.pnRl nfJl nf?l nni nT.puT.pnP rnPrnGl nfvs(il vProAl a 


140 






1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 I 1 It 




Db 


505 


CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 


564 


Ov 
vy 


14 1 




1 60 






1 1 1 II 1 1 1 M M 1 II 1! 1 M 1 1 II 1 1 1 1 II 1 1 1 1 1 1 i 1 II 1 M 1 1 1 1 1 1 1 I 1 II II M M 




Db 


565 


GGTGGC AGTC CCCCGGGC AT CGAT CCCACC AGGGGATCCC AGGGGGCACAGAGCAGCCCC 


624 


Ov 
vy 


161 


T.pii rCr~\ \ \ P rn r A rrr r^prTh rA rn P rnP rn^p rTlpAl^A <?nP m A ^nPrn^pr 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


625 


TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 


684 


Ov 

vy 


181 


AqnT.pn P rnVa 1 A^nA rn A 1 a A 1 aTh rl\\/Q A 1 ^ P rnHl \/Mpt~m 1 1 P rn^p rfil \/^p rVp 1 

AD k^JJC UriUVa 1AO \ J. y Al C* i\ X CI J. 1 1 X Xi _y 0/»X C* l LUU1 jr 1 IC Uul ULlUJClUiy JC1 VOX 


200 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


685 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


744 


Ov 
vy 


201 


AT pifil vLpiiHl vHI iiT.pn A^nP rnfil vA 1 ^ PhpT.pnAcinT.vciAcinAl aCll iifv^A mm n 


220 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 J 1 1 1 1 M 1 1 1 II 1 1 




Db 


745 


GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 


804 


Ov 
vy 


221 


HI iiT.Pi iT.p ii T.vc; A ^r* A cjn^p r^p rRl nHi \/A 1 a P roA qn^p rT.v^m iiT.v^Th rP rn 

VJ X UXJ C U ±J\Z U Jjy OAO^AO^ JC t JCi, Ul Uni J V-7X jAl CL t X UAO^/iJC X XJ jr D vj x uijy O 1 11 J- L X \J 


240 






1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 




Db 


805 


GAGCTGCT GAAAGATGAC AGCT CT GAAC ACGGCGCACCCGAC AGCAAAGAGAAGAC GCCT 


864 


Ov 
vy 


24 1 


m vA rnH i ^ A rnArnPhpTh rf^l vAcin^P r f^l vT 1 pfi!l nV^ 1 fv^V^ 1 Pv^A^nA rrrm v 

y a. x 1 1 x \ x, \j /ax u r 1 1 .1- mux y ad i-uij xicuJ. va J.V y d v axs-»\/ dadi iai yuxy 


260 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


865 


GGGAGAC ATCGCCGCT T C AC AGGT GACT CGGGCATT GAAGT GT GT GT GT GC AAC CGGGGC 


924. 


Ov 
vy 


261 


Hi sH i s AspAspAspLeu LysGXu PheAsnTh xLeu 1 1 eAspAspAXa Le uAspGl y P jto 


280 






i i i i m i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i m i i i i i i i i i i i i i i i 




Db 


925 


CACC AT GACGATGACCTC AAAGAGTT CAACAC ACTC AT CGAT GATGCTCTGGAT GGGCCC 


984 


Ov 

vy 


28 1 


T 1 onA«;nPh«=»r*v«?A«^n < ^f3 rPv^Hi cVa 1 A rnPrnP mHl vAe?r*m nf^l mC^~\ nf^l vT.p 1 1 P v<=: 


300 






1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


985 


CT GGACTT CT GCGAC AGCT GCC AT GT GC GGCC CC CT GGT GAT GAGGAGGAAGGC CT CT GT 


1044 


Ov 
vy 


301 


m n^prSp rfil nfil ufil nAl a A rnfil iiP rnfll vH i ^ P rnH i <?T*pn P rnA rrrP rnP rnA 1 a 


320 






Mi II llllll II Nil II MM II II II II II II II II II MM 1 1 1 M 1 1 1 1::: 




Db 


1045 


CAGCCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCTCA 


1104 


Qy 


321 


CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 


340 






1 1 1 1 1 1 1 1 l 1 1 1 1 1 If 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 \ ! 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 

1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 I 1 1 l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 I I 1 ! I 




Db 


1105 


T GCCT GCT GCT GAACACC AT C AAC GAGC AGGACT CT CC C AACT C CC AG AGC AGC AGCT CC 


1164 


Qy 


341 


ProSer 342 




Db 


1165 


llllll 

CCCAGC 1170 





RESULT 4 
CQ720290 

LOCUS CQ720290 1002 bp DNA linear PAT 03-FEB-2004 

DEFINITION Sequence 6224 from Patent WO02068579. 



http://es/ScoreAccessWeb/Gefltem.action?AppId=10729895&seqId=10487434& 



1/22/2007 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



JOURNAL 



FEATURES 

source 



ORIGIN 



CQ720290 
CQ720290.1 



GI:42281147 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleos tomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 

Venter, C. J., Adams, M.C., Li,P.W. and Myers, E.W. 

Kits, such as nucleic acid arrays, comprising a majority of 

humanexons or transcripts, for detecting expression and other uses 

thereof 

Patent: WO 02068579-A 6224 06-SEP-2002; 
PE Corporation (NY) (US) 

Location/Qualifiers 

1. .1002 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref="taxon:9606" 



Alignment Scores: 



Pred. No.: 


1.77e-125 


Length: 


1002 


Score: 


1863.00 


Matches : 


333 


Percent Similarity: 


100.0% 


Conservative: 


0 


Best Local Similarity: 


100.0% 


Mismatches: 


0 


Query Match: 


97.6% 


Indels: 


0 


DB: 


2 


Gaps: 


0 



US-10-729-895A-2 (1-342) x CQ720290 (1-1002) 



Qy 


10 


Db 


1 


Qy 


30 


Db 


61 


Qy 


50 


Db 


121 


Qy 


70 


Db 


181 


Qy 


90 


Db 


241 


Qy 


110 


Db 


301 


Qy 


130 


Db 


361 


Qy 


150 


Db 


421 


Qy 


170 


Db 


481 


Qy 


190 


Db 


541 


Qy 


210 


Db 


601 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



CysGlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuValTrp 4 9 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

T GT GGACAGT CT C AGT GCT GC AACT ACT ACT ATG AACT CT GGT GGT T CT GGCT GGT GT GG 120 

ThrllellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHisArg 69 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I . 
ACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCACCGC 180 

LeuGlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArgGluAlaHisAsn 89 

I I 1 1 1 1 1 1 1 1 U 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CT T C AGGC CC AGCAGCGGC AAC AT GAAAT C AACC T GAT CGCT T ACC GAGAAGCC C ACAAT 240 

TyrSerAlaLeuProPheTyrPheArgPheLeuProAsnTyrLeuLeuProProTyrGlu 109 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTATGAG 300 

GluValValAsnArgProProThrProProProProTyrSerAlaPheGlnLeuGlnGln 129 
I I I II I I I I I I I I I I U I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

GAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAGCAG 360 



I I I I I I I I 1 1 I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I 

C AGC AGCT GCT GCCT C C AC AGT GT GGCC CT GC AGGT GGC AGT CC CCCGGGC AT C GAT C CC 



420 



ThrArgGlySerGlnGlyAlaGlnSerSerProLeuSerGluProSerArgSerSerThr 169 
I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AC C AGGGGAT CC C AGGGGGC AC AG AGCAGC CC CT T GT C T G AGCC C AGC AG AAGC AGCACA 480 

ArgProProSerlleAlaAspProAspProSerAspLeuProValAspArgAlaAlaThr 189 

1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 ll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 

AGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCCACC 540 



1 I I I I I I I I I ! I I I I I I II I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I 



600 



229 



I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
GCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTCTGAA 



660 
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Qy 230 HisGlyAlaProAspSerLysGluLysThrProGlyArgHisArgArgPheThrGlyAsp 24 9 

I I I I I I I I I I I I I I I I I II I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I 

Db 661 CACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGTGAC 720 



Qy 250 SerGlylleGluValCysValCysAsnArgGlyHisHisAspAspAspLeuLysGluPhe 269 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 TCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTC 780 

Qy 270 AsnThrLeuIleAspAspAlaLeuAspGlyProLeuAspPheCysAspSerCysHisVal 289 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 AACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCATGTG 840 

Qy 290 ArgProProGlyAspGluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArgGlu 309 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db - 841 CGGCCCCCTGGT GATG AGGAGG AAGGCC T CTGTCAGTCCTCT GAGG AGC AGGCT C GAG AG 900 

Qy 310 ProGlyHisProHisLeuProArgProProAlaCysLeuLeuLeuAsnThrlleAsnGlu 329 

I I I I II I I I I I I I Ml I I I I I I I I I I I I i I I I I 

Db 901 CCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAACGAG 960 

Qy 330 GlnAspSerProAsnSerGlnSerSerSerSerProSer 34 2 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 961 CAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGC 999 



RESULT 5 
CQ413042 
LOCUS 

DEFINITION 
ACCESSION ' 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



ORIGIN 



CQ4 13042 3095 bp DNA linear PAT 23-JAN-2004 

Sequence 20113 from Patent W00170979. 

CQ413042 

CQ413042 .1 GI : 4 1320823 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 

Lee, J. and Lillie,J. 

Genes, compositions, kits, and method for identification, 
assessment, prevention, and therapy of ovarian cancer 
Patent: WO 0170979-A 20113 27-SEP-2001; 
Millennium Pharmaceuticals, Inc. (US) 

Location/Qualifiers 

1. .3095 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref="taxon:9606" 



Alignment Scores: 

Pred. No.: 6.34e-125 Length: 3095 

Score: 1863.00 Matches: 333 

Percent Similarity: 99.7% L Conservative: 0 

Best Local Similarity: 99.7% Mismatches: 1 

Query Match: 97.6% Indels: 0 

DB: 2 Gaps: 0 



US-10-729-895A-2 (1-342) x CQ413042 (1-3095) 



Qy 


9 


GlnAspLysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHis 


28 






1 1 1 1 1 1 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


109 


CAGGAT AAGGAAGCCT GT GT GGGT ACCAAC AATC AAAGCT AC AT CT GT GACACAGGAC AC 


168 


Qy 


29 


CysCysGlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuVal 


48 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


169 


TGCTGTGGACAGTCTCAGTGCTGCAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTG 


228 


Qy, 


49 


TrpThrllellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHis 


68 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


229 


TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 


288 


Qy 


69 


ArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArgGluAlaHis 


88 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


289 


CGCCTT CAGGCC C AGC AGCGGC AAC AT GAAAT C AAC CT GAT C GCTT AC CG AG AAGCCC AC 


348 


Qy 


89 


AsnTyrSerAlaLeuProPheTyrPheArgPheLeuProAsnTyrLeuLeuProProTyr 


108 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


34 9 


Qy 


109 


Db 


409 


Qy 


129 


Db 


469 


Qy 


149 


Db 


529 


Qy 


169 


Db 


589 


Qy 


189 


Db 


64 9 


Qy 


209 


Db 


709 


Qy 


229 


Db 


769 


Qy 


249 


Db 


829 


Qy 


269 


Db 


889 


Qy 


289 


Db 


94 9 


yy 


309 


Db 


1009 


Qy 


329 


Db 


1069 



4 08 



128 



I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I i I I I I I I I I 



GlnGlnGlnLeuLeuProProGlnCysGlyProAlaGlyGlySerProProGlylleAsp 148 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAGCAGCAGCTGCT GC CTCC AC AGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGC AT CGAT 528 

ProThrArgGlySerGlnGlyAlaGlnSerSerProLeuSerGluProSerArgSerSer 168 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 588 



188 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 



648 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 708 

GlyAlaPheLeuAspLysAspAlaGluCysArgGluGluLeuLeuLysAspAspSerSer 228 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGGGCCTTCCT GGACAAAGAT GCAGAAT GT AGGGAGGAGCTGCT GAAAGATGAC AGCTCT 7 68 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I 

GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 828 

AspSerGlylleGluValCysValCysAsnArgGlyHisHisAspAspAspLeuLysGlu 268 
I I I I I I I I I I I I I I 1 I I I I i I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I 
GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 888 

PheAsnThrLeuIleAspAspAlaLeuAspGlyProLeuAspPheCysAspSerCysHis 288 
I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I 
TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 948 

ValArgProProGlyAspGluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArg 308 
I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I 

GT GC GGCCCC CT GGTGAT GAGGAGGAAGGCCT CT GT C AGCCCT CTGAGGAGC AGGCT C GA 1 008 

GluProGlyHisProHisLeuProArgProProAlaCysLeuLeuLeuAsnThrlleAsn 328 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 

GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 1068 



GluGlnAspSerProAsnSerGlnSerSerSerSerProSer 
I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I 



342 



1110 



RESULT 6 
BD275013 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 



BD275013 4054 bp DNA linear PAT 17-JUL-2003 

50 Human Secreted Proteins. 

BD275013 

BD275013.1 GI :33084781 
JP 2002542766-A/13. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 (bases 1 to 4054) 

Komatsoulis, G. , Rosen, C. A. and Ruben, S.M. 

50 Human Secreted Proteins 

Patent: JP 2002542766-A 13 17-DEC-2002; 

Rosen et al 

OS Homo sapiens 

PN . JP 2002542766-A/13 

PD 17-DEC-2002 

PF 09-MAR-2000 JP 200060'5624 

PR 12-MAR-1999 US 60/124093, 23-NOV-1999 US 60/166989 PI 
george komatsoulis, craig a rosen, Steven m ruben CC 
FH Key Location/Qualifiers. 
Location/Qualifiers 
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source 



1. .4054 



/organism="Homo sapiens" 
/ mo l_type=" genomic DNA" 
/db_xref="taxon : 9606" 



Alignment Scores: 

Pred. No.: 1.46e-123 

Score: 1846.00 

Percent Similarity: 99.4% 
Best Local Similarity: 99.4% 

Query Match: 96.7% 

DB: 2 



Length: 
Matches 



Conservative: 
Mismatches: 



Indels : 
Gaps : 



0 
1 
1 
0 



4054 
333 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



729-895A-2 (1-342) x BD275013 (1-4054) 

9 GlnAspLysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHis 28 



107 CAGGAT AAGGAAGCCT GT GT GGGT ACCAACAATC AAAGCT AC ATCT GT GACACAGGAC AC 166 

2 9 CysCysGlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuVal 4 8 
I I I I 1 1 I I t I I I I I I Ml I I 1 1 I I I 1 1 1 1 I I I I I II I I I I I I 1 I I I I I I 

167 T GCT GT GG AC AGT CTC AGT GCT GC AACT AC TACT AT GAACT CT GGT GGTT CT GGCT GGT G 226 

4 9 TrpThrllellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHis 68 
I I I I 1 1 I I I J I I I I I I M I I 1 1 I I I I I I I I I I I I 1 1 I I 1 1 II I I I II I I I I I I I I I I II I 
227 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 286 

69 ArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArgGluAlaHis 88 

I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
287 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 34 6 

8 9 AsnTyrSerAlaLeuProPheTyrPheArgPheLeuProAsnTyrLeuLeuProPro-Ty 108 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
34 7 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTTA 4 06 

108 rGluGluValValAsnArgProProThrProProProProTyrSerAlaPheGlnLeuGl 128 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

407 T G AGGAAGT GGT GAAC C G AC CT CCAACT CCTC CC C C AC CAT AC AGT GC CT TC C AGC T AC A 4 66 

128 nGlnGlnGlnLeuLeuProProGlnCysGlyProAlaGlyGlySerProProGlylleAs 148 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
4 67 GCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGA 526 

14 8 pProThrArgGlySerGlnGlyAlaGlnSerSerProLeuSerGluProSerArgSerSe 1 68 

I I I I 1 I I I I II I I I I I I I I I 1 II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 
527 TCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAG 586 

168 rThrArgProProSerlleAlaAspProAspProSerAspLeuProValAspArgAlaAl 188 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

587 CACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGC 64 6 

188 aThrLysAlaProGlyMetGluProSerGlySerValAlaGlyLeuGlyGluLeuAspPr 208 

I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
64 7 CACC AAAGCCCCAGGGAT GGAGCCCAGT GGCT CT GT GGCTGGCCTGGGGGAGCT GGACCC 706 

208 0GlyAlaPheLeuAspLysAspAlaGluCysArgGluGluLeuLeuLysAspAspSerSe 228 



707 GGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTC 7 66 

228 rGluHisGlyAlaProAspSerLysGluLysThrProGlyArgHisArgArgPheThrGl 248 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
767 TGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGG 826 

24 8 yAspSerGlylleGluValCysValCysAsnArgGlyHisHisAspAspAspLeuLysGl 268 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I ! I I I I I I I I I I I I 1 I I I I I I I I I I 
827 TGACTCGGGCAT T GAAGT GT GTGTGTGC AACCGGGGCC ACCATGAC GATGACCT CAAAGA 886 

268 uPheAsnThrLeuIleAspAspAlaLeuAspGlyProLeuAspPheCysAspSerCysHi 288 



887 GTTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCA 94 6 

288 sValArgProProGlyAspGluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaAr 308 

I I I I II I I I I I I I I I I I I I I 1 .1 I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
94 7 T GT GCGGCCC CCT GGT GAT G AGGAGG AAGGCCTCT GT C AGT C CT CT GAGG AGC AGGCT CG 1006 
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Qy 

Db 
Qy 

Db 



308 gGluProGlyHisProHisLeuProArgProProAlaCysLeuLeuLeuAsnThrl leAs 328 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1007 AGAGCCTGGGCACCCGGACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAA 1066 



328 



342 



nGluGlnAspSerProAsnSerGlnSerSerSerSerProSer 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1067 CGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGC 1109 



RESULT 7 
BC026369 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



CONSRTM 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
CONSRTM 
TITLE 
JOURNAL 

REMARK 
COMMENT 



BC026369 2241 bp mRNA linear ROD 28-JUL-2005 

Mus musculus DNA segment, Chr 19, Wayne State University 162, 
expressed, mRNA (cDNA clone MGC: 31104 IMAGE: 41 60029) , complete cds . 
BC026369 

BC026369.1 GI:20071329 
MGC. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebra ta; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 

1 (bases 1 to 2241} 

Strausberg,R.L. , Feingold, E. A. , Grouse, L.H., Derge,J.G., 
Klausner, R.D. , Collins, F.S . , Wagner, L., Shenmen, CM. , Schuler, G. D. , 
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer ,C . F. , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J. , Hsieh,F., 
Diatchenko,L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M.B., Bonaldo, M. F . , Casavant, T . L . , 
Scheetz,T.E. , Browns tein,M. J. , Usdin,T.B., Toshiyuki , S . , 
Carninci,P., Prange,C, Raha,S.S., Loquellano, N . A. , Peters, G. J., 
Abramson,R.D. , Mullahy, S . J. , Bosak,S.A., McEwan,P.J., 
McKernan, K. J. , Malek,J.A., Gunaratne, P . H . , Richards, S., 
Worley,K.C, Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon,D.K. , Muzny, D .M . , Sodergren, E . J. , Lu,X., Gibbs,R.A., 
Fahey, J., Helton, E., Ketteman,M., Madan,A., Rodrigues, S . , 
Sanchez, A., Whiting, M. , Madan,A., Young, A. C, Shevchenko, Y . , 
Bouffard, G.G. , Blakesley, R.W. , Touchman, J. W. , Green, E.D., 
Dickson, M. C. , Rodriguez, A. C. , Grimwood,J., Schmutz,J., Myers, R.M., 
Butterfield, Y.S. , Krzywinski,M. I . , Skalska,U., Smailus, D.E . , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
Mammalian Gene Collection Program Team 

Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
12477932 

2 (bases 1 to 2241) 

NIH MGC Project 
Direct Submission 

Submitted (02-APR-2002 ) National Institutes of Health, Mammalian 

Gene Collection (MGC), Bethesda, MD 20892-2590, USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact: MGC help desk 

Email: cgapbs-r@mail ..nih .gov 

Tissue Procurement: Jeffrey E. Green, M.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: "The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Sequencing Group at" the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanford, CA 94 305 

Web site: http://www-shgc.stanford.edu 

Contact: (Dickson, Mark) mcd@paxil.stanford.edu 

Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, 
R. M. r 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 4 3 Row: c Column: 12 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi: 22122608. 
FEATURES Location/Qualifiers 
source 1. .2241 

/organism="Mus musculus" 
/ mo 1 type = " mRNA " 
/strain="FVB/N" 
/db_xref="taxon : 10090" 
/clone="MGC: 31104 IMAGE : 4 160029" 



http://es/ScoreAccessWeb/Gett^ 1/22/2007 



/tissue_type= "Liver, normal. 5 month old male mouse." 

/clone_lib="NCI_CGAP_Li9" 

/lab_host="DH10B" 

/note="Vector : pCMV-SP0RT6" 
gene 1. .2241 

/gene="D19Wsul62e" 

/note="synonym: MGC31104" , 

/db_xref="GeneID: 226178" 

/ db_x re f = "MGI : 1 07 57 7 " 
CDS 112. .1194 

/gene="D19Wsul62e" 

/codon_start=l 

/product= "hypothetical protein LOC226178" 
/protein_id="AAH2 6369.1" 
/db_xref="GI: 20071330" 
/db_xref="GeneID: 226178" 
/ db_x re f = "MGI : 1 07 57 7 " 

/ translation="h4ALLLLQALPSPLSVRAEPPQDKEACVGTNNQSYICDTGHCCGQ 
SQCCNYYYELWWFWLVWTVVI ILSCCCVCHHRRAKHRLQAQQRQHEINLIAYREAHNY 
SALPFYFRFLPNSLLPPYEEWNRPPTPPPPYSAFQLQQQQQLLPPPPQGGPPGGSPP 
GTDPPPQGSQGAQSSPLSGPSRSSTRPPSVADPQS PEVPTDREATKASGTESGSPMAG 
HGELDPGAFLDQDSECKEELLKDSRSERGGVSPDSEDKTPGRHRRFTGDSGIEVCVCN 
RGHHDDDLKEFNTLIDDALDGPLDFCDSCHVRPPVDEEEGLCLSSEGQAREHGHPHLP 
RPPACLLLNTINEQDSPNSQHSGSPS" 

ORIGIN 



Alignment. Scores: 



Pred. No.: 


2.36e-107 


Length: 


2241 


Score:' 


1618.00 


Matches : 


296 


Percent Similarity: 


88.4% 


Conservative : 


15 


Best Local Similarity: 


84 .1% 


Mismatches : 


31 


Query Match: 


84 .8% 


Indels: : 


10 


DB: 


6 


Gaps: 


4 



US-10-729-895A-2 (1-342) XBC026369 (1-2241) 



Qy 1 MetProPheLeuLeuGlyLeuArg GlnAspLysGluAlaCysValGly 16 

:: : I I I III :: : I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 136 CT GC CC AGCC CCCT GT CAGT GAGGGCTGAGCCCCCGC AGGAT AAGGAAGCCT GT GT GGGT 195 

Qy 17 ThrAsnAsnGlnSerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCys 36 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I 
Db 196 AC C AAC AAT C AAAGCT AC AT CT GT GAC AC AGG AC ACT GCT GT GG AC AGT CT CAGT GCT GC 255 



Qy 37 AsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuValTrpThrllellellelleLeuSer 56 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I :::::: I I I I I ! I I I I I I . 
Db 256 AACTACTACTATGAACTCTGGTGGTTCTGGCTGGTGTGGACCGTCGTCATCATTCTGAGC 315 

Qy 57 CysCysCysValCysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGln 76 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 316 TGCTGCTGTGTCTGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCTCAGCAGCGGCAA 375 

Qy 77 HisGluIleAsnLeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyr 96 

I I I I I I I I I I I I I I I I I I I 1 I I I I I I 1 I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db -37 6 CATGAAATCAACCTGATCGCTTACCGGGAAGCCCACAACTACTCAGCGTTGCCGTTTTAC 4 35 

Qy 97 PheArgPheLeuProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProPro 116 

I I I I I II I I ! I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 436 TTCAGGTTTTTGCCAAACTCTTTGCTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCA 4 95 

Qy 117 ThrProProProProTyrSerAlaPheGlnLeuGlnGlnGlhGlnLeuLeu 133 

I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 496 ACTCCTCCCCCACCGTACAGTGCCTTCCAGCTCCAACAGCAGCAGCAGCTGCTGCCACCT 555 

Qy 134 ProProGlnCysGlyProAlaGlyGlySerProProGlylleAsp ProThrArgGly 152 

III I I I I I I I 1 I I I I I I 1 I II I I I I I I I III ill :::lll 

Db 556 CCTCCTCAGGGTGGCCCTCCAGGTGGCAGCCCCCCAGGCACTGACCCGCCGCCCCAGGGC 615 

Qy 153 SerGlnGlyAlaGlnSerSerProLeuSerGluProSerArgSerSerThrArgProPro 172 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I .1 I I I II I 

Db 616 TCCCAGGGAGCGCAGAGCAGTCCTTTGTCCGGACCGAGCCGAAGCAGCACAAGACCTCCA 675 

Qy 173 SerlleAlaAspProAspProSerAspLeuProValAspArgAlaAlaThrLysAlaPro 192 

I I I ::: I I I I I I I I I :::::: | | | I I I I I I I I I I II I I I I I I 

Db 67 6 AGT GT CGC AG AT CCT C AGT C CC CT GAAGTGCC C ACT GACC GGGAAGC C ACC AAAGC CT CT 735 
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Qy 


193 


GlyMetGluProSerGlySerValAlaGlyLeuGlyGluLeuAspProGlyAlaPheLeu 


212 






III III ~ : : : M 1 1 1 I 1 1 M 1 1 II 1 1 M M 1 1 1 1 1 II 1 1 1 M I 




Db 


736 


GGGACGGAGI Ci GGI AGGCCCAIuGCCuGCCACGGGGAGCiGGACCCGt^ 1 ^tHj 


/ y d 


Qy 


213 


AspLysAspAlaGluCysArgGluGluLeuLeuLysAspAspSerSerGluHisGly 


231 






1 1 I : : : 1 1 1 : : : 1 1 1 1 1 1 : : : 1 \ 1 1 1 1 II ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 




Db 


796 


GACCAGGATTCCGAGTGCAAGGAGGAGC 1 ACI GAAAGATTCGCGT I Ci GAGCGI GGCCCC 


ODD 


Qy 


232 


AlaProAspSerLysGluLysThrProGlyArgHisArgArgPheThrGlyAspSer 


250 






::: 1 1 1 1 1 1 1 1 1 :::::: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 




Db 


856 


GTGTCCCCTGACAGTGAAGACAAGACCCCCGGCAGGCATCGCCGCTTCACAGGTGACl LA 


QIC 

91b 


Qy 


251 


GlylleGluValCysValCysAsnArgGlyHisHisAspAspAspLeuLysGluPheAsn 


270 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


916 


GG C AT T GAGGT GT GT GT GT GC AAC CGGGGC C ACC AT GAC G AT GACCT C AAAG AGTT T AAC 


975 


Qy 


271 


ThrLeuIleAspAspAlaLeuAspGlyProLeuAspPheCysAspSerCysHisValArg 


290 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


976 


TV r* /-•/"-> m/-< 7vm7V/—7\rp/->7v m l~> /rn /~>rn 1— 7v t~> r>r" /"*/""■ c*C* <T* /"•/""• 7\ /"" ,r PT> /~"T> /T f 7\ 7\ f~"T> /""T /~" 71 rp /— rp 7\ f""'/"-/— 
ACGCTC AT AGAT GATGCT CT GGAC GGGC CCC 1 GGAC I 1 CI Gl GACAGCi GiCAi bi ACCu 


IUdd 


Qy 


291 


ProProGlyAspGluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArgGluPro 


310 






1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 




Db 


1036 


CC T C CT GT TG AT GAAG AGGAAGGT CT CT GC CT GT CCT C GG AGGGT C AGGCT C GGGAGC AT 


1095 


Qy 


311 


GlyHisProHisLeuProArgProProAlaCysLeuLeuLeuAsnThrlleAsnGluGln 


330 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1! 1 1 1 




Db 


1096 


GGGCACCCCCACCTGCCACGGCCGCCTGCGTGTCTGCTGCTAAACACCATCAATGAGCAG 


1155 


Qy 


331 


AspSerProAsnSerGlnSerSerSerSerProSer 342 




Db 


1156 


1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 

GACTCTCCAAATTCCCAGCACAGTGGCTCCCCGAGC 1191 





start I next page 
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SCORE Search Results Details for Application 
10729895 and Search Result 
20070112_180208_us-10-729-895a-2.rst. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 

20070112_180208_us-10-729-895a-2.rst. 

start I next page 

Go Back to previous page 



GenCo're version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 

OM protein - nucleic search, using f rame_plus_p2n model 

Run on: January 16, 2007, 05:08:30 ; Search time 4416 Seconds 

(without alignments) 
6496.064 Million cell updates/sec 

US-10-729-895A-2 
1909 

1 MPFLLGLRQDKEACVGTNNQ LLNTINEQDSPNSQSSSSPS 342 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



BLOSUM62 
Xgapop 10.0 
Ygapop 10.0 
Fgapop 6 . 0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 

7.0. 



Searched: 



48236798 seqs, 27959665780 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



96473596 



Post-processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlp 

-Q=/abss/ABSSWEB_spool/US1072 98 95/runat_12012007_180208_7352/app_query.fasta_l 
-DB=EST -QFMT=fastap -SUFFIX=rst -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END--1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=4 5 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -M0DE=L0CAL 
-OUTFMT=pto -N0RM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 -HOST=abss03p 
-USER=US10729895_@CGN_l_l_9528_@runat_12012007_180208_7352 -NCPU=6 -ICPU=3 
-N0_MMAP ■ -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG -DEV_TIMEOUT=120 
-WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP-6 -FGAPEXT=7 
-YGAPOP=10 -YGAPEXT=0.5 -DEL0P=6 -DELEXT=7 



Database : 



EST: * 
1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 



gbestl : * 
gb_est3: * 
gb_est4 : * 
gb_est5: * 
gb_est6: * 
gb_htc:* 
gb_est2:* 
gb_est7: * 
gb_est8: * 
gb_est9 :+ 
gb_gssl :* 



http://es/ScoreAccessWeb/GetItem.action?AppId=10729895&seqId=1048744&ItemName... 1/22/2007 



12: gb_gss2:* 
13: gb_gss3:* 
14 : gb_gss4 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
CR859701 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 



CR859701 



2792 bp mRNA linear HTC 12-NOV-2004 
(from clone DKFZp4 69N2325) . 



Pongo pygmaeus mRNA; cDNA DKFZp4 69N2 325 
CR859701 

CR859701.1 GI:55730274 
HTC. 

Pongo pygmaeus (orangutan) 
Pongo pygmaeus 

Eukaryota; Metazoa; Chorda ta; Crania ta; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Pongo. 
1 (bases 1 to 2792) 

Wambutt,R., Heubner,D., Mewes,H.W., Weil,B., Amid,C, Osanger,A., 



Vertebrata; Euteleostomi; 
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CONSRTM 

TITLE 

JOURNAL 

COMMENT 



FEATURES 

source 



gene 
CDS . 



Fobo,G., Han,M. and Wiemann,S. 
The German cDNA Consortium 
Direct Submission 

Submitted ( 12-NOV-2004 ) MIPS, Ingolstaedter Landstr.l, D-85764 
Neuherberg, GERMANY 

Clone from S. Wiemann, Molecular Genome Analysis, German Cancer 
Research Center (DKFZ) ; Email s.wiemann@dkfz-heidelberg.de; 
sequenced by Agowa (Berlin/Germany) within the cDNA sequencing 
consortium of the German Genome Project. 

This clone ( DKFZp4 69N2325) is available at the RZPD Deutsches 
Ressourcenzentrum fuer Genomf orschung GmbH in Berlin, Germany. 
Please contact RZPD for ordering: 

http://www. rzpd.de/cgi-bin/products/cl.cgi?CloneID=DKFZp4 69N2325 
Further information about the clone and the sequencing project is 
available at http://mips.gsf.de/projects/cdna/. 

Location/ Qualifiers 

1. .2792 

/organism="Pongo pygmaeus" 
/ mo 1_ t ype = " mRN A " 
/db_xref="taxon:9600" 
/clone="DKFZp4 69N2325" 
/ tissue_type="kidney" 

/clone_lib="469 (synonym: pkidl). Vector pSportl_Sfi; host 

DH10B; sites SfilA + SfilB" 

/dev_stage="adult" 

/note="hypothetical protein (Mus musculus), differentially 

spliced" 

1. .2792 

/gene="DKFZp4 69N2325" 
43. .360 

/gene="DKFZp4 69N2325" 
/codon__start=l 

/product="hypothetical protein" 
/protein_id="CAH918 60 .1" 
/db_xref="GI : 55730275" 

/translation= "MERRRLLGGMALLLLQALPSPLSARAEPPQLFGGASCGSLSCVW 
LTT PALLVQYWGFQGRI RKPVWVPT I KAT SVTQDT AVDS LS AATTTMNSGGSGWCGP S 
SSS" 



ORIGIN 



Alignment Scores: 

Pred. No.: 6.11e-133 Length: 2792 

Score: 1847.50 Matches: 331 

Percent Similarity: 98.8% Conservative: 5 

Best Local Similarity: 97.4% Mismatches: 3 

Query Match: 96.8% Indels: 1 

DB: 6 Gaps: 1 

US-10-729-895A-2 (1-342) XCR859701 (1-2792) 



Qy 


4 


Db 


201 


Qy 


23 


Db 


261 


Qy 


43 


Db 


321 


Qy 


63 


Db 


381 


Qy 


83 


Db 


441 


Qy 


103 


Db 


501 


Qy 


123 



LeuLeuGlyLeuArg GlnAspLysGluAlaCysValGlyThrAsnAsnGlnSerTyr 22 

: : : I I I I I I ::: II I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I ! I I M I 

GTACTGGGGGTTTCAGGGCAGGATAAGGAAGCCtGTGTGGGTACCAACAATCAAAGCTAC 2 60 



I I I i I I I I I I I I I I I I I I I I i I I I I I I I I I I I ! II I It II I I I I i I I I I I I I I I I I I I I I 



320 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 1 I I I I I I I I 



380 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATC 4 40 



I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
GCCTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAAC 500 



122 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 

TATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATAC 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M 



560 
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Db 


561 


AGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGC 


620 


Qy 


143 


SerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerProLeuSer 


162 






MUM MM Itllll IMIIIIMMlllMltll MINI IIIIIMItl!! 




Db 


621 


AGT CCCCCGGGC GT T GAT CC C ACC AGGGGATC CC AGGGGGC AC AGAGC AGCC CCTT GT CT 


680 


Qy 


163 


GluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSerAspLeu 


182 






1 1 1 II 1 1 1 1 1 1 1 1 II Ill MM 1 1 II II II 11 l:::M 1 1 II II II II 




Db 


681 


GGGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGAGCCCTCTGACCTA 


740 ' 


Ov 


183 


ProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerValAlaGly 


202 






II II ll:::| M II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 




Db 


741 


CCAGTTGAACGAGCAGCCACCAAAGCCCCGGGGATGGAGCCCAGTGGCTCTGTGGCTGGC 


800 


Ov 


203 


LeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGluGluLeu 


222 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M i 1 




Db 


801 


CTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAGGATGCAGAATGTAGGGAGGAGCTG 


860 


Ov 


223 


LeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrProGlyArg 


242 






1 1 1 1 1 ! II 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 II 1 1 1 1 1 1 II II M II 1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 




Db 


861 


CTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGA 


920 


Ov 


243 


HisArgArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGlyHisHis 


262 






II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M M M 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


921 


CATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCAT 


980 


Ov 
sty 


263' 


AspAspAspLeuLysGluPheAsnThrLeuI leAspAspAlaLeuAspGlyProLeuAsp 


282 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


981 


GACGAT GACCTCAAAGAGTT CAAC ACGCTC AT CGAT GATGCT CT GGATGGGCCCCT GGAC 


1040 


Ov 


283 


PheCysAspSerCysHisValArgProProGlyAspGluGluGluGlyLeuCysGlnSer 


302 






II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 M 1 1 M II II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1041 


T T CT GCGAT AGCT GC C AT GT GC GGCC CC CT GGTGAC GAGG AGGAAGGCCT CT GC C AGC CC 


1100 


Ov 


303 


SerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAlaCysLeu 


322 






M M M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! f 1 1 t 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

i i i i i i i i i i i i i i i i i i i i i i i i i i < i i I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 




Db 


1101 


TCCGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTG 


1160 


Qy 


323 


LeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSerProSer 


342 






M M 1 1 M 1 1 1 1 1 1 M 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


1161 


CT GC T G AAC AC C AT C AAC GAGCAGGACTCTCCCAACTCCC AGAGC AGC AGCT CCCCC AGC 


1220 



AK052400 2603 bp mRNA linear HTC 02-SEP-2005 

Mus musculus 13 days embryo lung cDNA, RIKEN full-length enriched 
library, clone :D430003C10 product : hypothetical Proline-rich 
region/Cytochrome c family heme-binding site containing protein, 
full insert sequence. 
AK0524 00 

AK052400.1 GI:26342634 
HTC; CAP trapper. 
Mus musculus (house mouse) . 
Mus musculus 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol. 303, 19-44 (1999) 
10349636 
2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

11042159 

3 

Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N . , Carninci,P., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A., 
Yamamoto,R., Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K . , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
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RESULT 2 
AK052400 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS . 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
.AUTHORS 



Okazaki,Y., Murama tsu, M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 
TITLE RIKEN integrated sequence analysis (RISA) system--384-format 

sequencing pipeline with 384 mul ticapillary sequencer 
JOURNAL Genome Res. 10 (11), 1757-1771 (2000) 
PUBMED 11076861 
REFERENCE . 4 

AUTHORS The RIKEN Genome Exploration Research Group Phase II Team and the 

FAN TOM Consortium. 
TITLE Functional annotation of a full-length mouse cDNA collection 

JOURNAL Nature 409, 685-690 (2001) 
REFERENCE 5 

AUTHORS The FANTOM Consortium, the RIKEN Genome Exploration Research Group 

Phase I and II Team. 
TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 
JOURNAL Nature 4 20, 563-573 (2002) 
REFERENCE 6 

AUTHORS RIKEN Genome Exploration Research Group, Genome Science Group 

(Genome Network Core Team) and the FANTOM Consortium. 
TITLE Antisense Transcription in the Mammalian Transcriptome 

JOURNAL Science 309, 1564-1566 (2005) 
REFERENCE 7 

AUTHORS The FANTOM Consortium, Riken Genome Exploration Research Group and 
Genome Science Group (Genome Network Project Core Group) . 

TITLE The Transcriptional Landscape of the Mammalian Genome 

JOURNAL Science 309, 1559-1563 (2005) 
REFERENCE 8 (bases 1 to 2603) 

AUTHORS Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama,T. , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K., Numazaki,R., Ohno,M., Ohsato,N.-, 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D. , Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi , F . , Takaku-Akahira, S . , 
Takeda,Y.., Tanaka,T.', Tomaru,A., Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 

TITLE Direct Submission 

JOURNAL Submitted (16- JUL-2001 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC), 
RIKEN Yokohama Institute ; 1-7,-22 Suehiro-cho, Tsurumi-ku , Yokohama , 
Kanagawa, 230-0045, Japan (E-mail : genome-res@gsc . riken . jp,' 
URL: http://genome.gsc. riken. jp/, Tel : 8 1-4 5-503-9222, 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL: http://genome.gsc. riken. jp/ 
URL: http: //fantom. gsc. riken . jp/ . 
FEATURES Location/Qualifiers 
source ' 1. .2603 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref="FANTOM_DB:D4 30003C10" 

/db_xref="taxon: 10090" 

/clone="D4 30003C10" 

/tissue__type="lung" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="13 days embryo" 
CDS 143. .1189 

/note="unnamed protein product; hypothetical Proline-rich 

region/Cytochrome c family heme-binding site containing 

protein ( InterPro | IPR000694 InterPro j IPR00034 5, evidence: 

InterPro) 

putative " 

/codon_start=l 

/protein_id="BAC34 974 . 1" 

/db_xref="GI : 26342635" 

/ translation="MPFLWGLRQDKEACVGTNNQSYICDTGHCCGQSQCCNYYYELWW 



http://es/ScoreAccessWeb/Ge^ 



1/22/2007 



polyA_signal 
polyA_site 
ORIGIN 



FWLVWTWIILSCCCVCHHRRAKHRLQAQQRQHEINLIAYREAHNYSALPFYFRFLPN 

SLLPPYEEWNRPPTPPPPYSAFQLQQQQQLLPPPPQGGPPGGSPPGADPPPQGSQGA 

QSSPLSGPSRSSTRPPSVADPQSPEVPTDREATKASGTESGSPMAGHGELDPGAFLDQ 

DSECKEELLKDSRSERGGVSPDSEDKTPGRHRRFTGDSGIEVCVCNRGHHDDDLKEFN 

TLIDDALDGPLDFCDSCHVRPPVDEEEGLCLSSEGQAREHGHPHLPRPPACLLLNTIN 

EQDSPNSQHSGSPS" 

2584. .2589 

/ no te=" putative " 

2603 

/note=" putative " 



Alignment Scores: 

Pred. No.: 1.13e-117 Length: 2603 

Score: 1651.00 Matches: 300 

Percent Similarity: . 89.9% Conservative: 13 

Best Local Similarity: 86.2% Mismatches: 29 

Query Match: 86.5% Indels: 6 

DB: 6 Gaps: 3 

US-10-729-895A-2 (1-342) x AK052400 (1-2603) 



Qy 


i 


Db 


143 


Qy 


21 


Db 


203 


Qy 


■41 


Db 


263 


Qy 


61 


Db 


323 


Qy 


81 


Db 


38 3 


Qy 


101 


Db 


443 


Qy 


121 


Db 


503 


Qy 


138 


Db 


563 


Qy 


157 


Db 


623 


Qy 


177 


Db 


68 3 


Qy 


197 


Db 


743 


Qy 


• 217 


Db 


803 


Qy 


235 


Db 


863 


Qy 


255 



MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



20 



202 

SerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 4 0 
I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AGCT AC AT CT GT GACACAGGAC ACTGCT GT GGAC AGTCT CAGT GCT GCAACT ACTACT AT 2 62 

GluLeuTrpTrpPheTrpLeuValTrpThrllellellelleLeuSerCysCysCysVal 60 
I II I I E I 1 I I I I I I I I N I I I I I I I 1 I I I I :::::: I I I I I I I I I I I I E I I I I I I I I I I I 

GAACTCTGGTGGTTCTGGCTGGTGTGGACCGTCGTCATCATTCTGAGCTGCTGCTGTGTC 322 

CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I 

T GCC AC C ACC GC CGAGCCAAGC AC C GCC TT C AGG CT C AGC AGCGGC AAC AT G AAAT C AAC 382 



LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 
I I I I I I I I I I I I I I I I I I I I I I I I I I E I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I IE I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I ::: I I I I I I 



I I 



I I I I I I 



I I I I t I I I I t I I 



i I I 



I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : : I I I : 



100 



120 



CCAAACTCTTTGCTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 502 

ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeu ProProGlnCys 137 

I I I I I I I I I I I I I E I I I I I I II I I I I I I I I I I I III I I I I I I II I 

CCGTACAGTGCCTTCCAGCTCCAACAGCAGCAGCAGCTGCTGCCACCTCCTCCTCAGGGT 562 

GlyProAlaGlyGlySerProProGlylleAsp ProThrArgGlySerGlnGlyAla 156 

I I I I I I I I I I I I I I I I I I 1 I I I I I III III :: : I I I I I I I I I I I I I I I 

GGCC CT CC AGGT GGC AGCCC CC C AGGCGCT GACC CGCCGC CCC AGGGCT C CC AGGG AGCG 622 



682 



802 



234 



GluCysArgGluGluLeuLeuLysAspAspSerSerGluHisGly AlaProAsp 

I I I I I I ::: I I I I I I I I I I I I I I I I I I I I I I I I III : : : I I I I I I 

GAGT GC AAGGAGGAGCT ACT GAAAGATT CGCGTT CT GAGCGT GGCGGCGT GT CC CCTGAC 8 62 



SerLysGluLysThrProGlyArgHisArgArgPheThrGlyAspSerGlylleGluVal 
I I I :::::: I I I I I I I I I I I I I I I I I ) I I I I I I I I I I I I E E I I I I I i I I I I I I I I I I I I I 
AGTGAAGACAAGACCCCCGGCAGGCATCGCCGCTTCACAGGTGACTCAGGCATTGAGGTG 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 



254 



922 
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Db 



923 TGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTTAACACGCTCATAGAT 982 



Qy 275 AspAlaLeuAspGlyProLeuAspPheCysAspSerCysHisValArgProProGlyAsp 2 94 

I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I ! I I I I I IM 

Db 983 GATGCTCTGGACGGGCCCCTGGACTTCTGTGACAGCTGTCATGTACGGCCTCCTGTTGAT 104 2 

Qy 295 GluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArgGluProGlyHisProHis 314 

1 I I I M I I I I 1 I I I I I I I I I I I I I I I I I I I I I I t t I I I I - ! I I I I ! I I I I I I 
Db 104 3 GAAGAGGAAGGTCTCTGCCTGTCCTCGGAGGGTCAGGCTCGGGAGCATGGGCACCCCCAC 1102 

Qy 315 LeuProArgProProAlaCysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsn 334 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 1103 CTGCCACGGCCGCCTGCGTGTCTGCTGCTAAACACCATCAATGAGCAGGACTCT-CCAAAT 1162 

Qy 335 SerGlnSerSerSerSerProSer 342 

I I I I I I III I I I I I I I I I 
Db 1163 TCCCAGCACAGTGGCTCCCCGAGC 118 6 



RESULT 3 
AK078491 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 



AK078491 2702 bp mRNA linear HTC 02-SEP-20O5 

Mus musculus 12 days embryo female mullerian duct includes 
surrounding region cDNA, RIKEN full-length enriched library, 
clone: 6820415N12 product : hypothetical Proline-rich 
region/Cytochrome c family heme-binding site containing protein, 
full insert sequence. 
AK078491 

AK078491.1 GI:26347312 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteledstomi ; 
Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol. 303, 19-44 (1999) 
10349636 
2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y . 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

11042159 

3 

Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., Carninci,P., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A., 
Yamamoto,R., Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu, M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 
RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
11076861 
4 

The RIKEN Genome Exploration Research Group Phase II Team and the 
FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409, 685-690 (2001) 

5 

The FANTOM Consortium, the RIKEN Genome Exploration Research Group 
Phase I and II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 

RIKEN Genome Exploration Research Group, Genome Science Group 
(Genome Network Core Team) and the FANTOM Consortium. 
Antisense Transcription in the Mammalian Transcriptome 
Science 309, 1564-1566 (2005) 
7 

The FANTOM Consortium, Riken Genome Exploration Research Group and 
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TITLE 
JOURNAL 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



CDS 



Genome Science Group (Genome Network Project Core Group) . 
The Transcriptional Landscape of the Mammalian Genome 
Science 309, 1559-1563 (2005) 
8 (bases 1 to 2702) 

Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Ma tsuyama , T . , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K., Numazaki,R., Ohno,M., 0hsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa,A w Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A., Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 
Direct Submission 

Submitted ( 1 6-APR-2002 ) Yoshihide Hayashizaki, .The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa, 230-0045, Japan (E-mail :genome-res@gsc. riken . jp, 
URL : http : //genome . gsc . riken . jp/ , Tel : 81-4 5-503-9222, 
Fax:81-45-503-9216) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL: http: //genome .gsc. riken. jp/ 
URL: http: //fan torn. gsc . riken. jp/ . 

Location/Qualifiers 

1. .2702 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_x re f = " FANT0M_DB :6820415N12" 

/db_xref= M taxon: 10090" 

/clone="6820415N12" 

/sex="female" 

/tissue_type="mullerian duct includes surrounding region" 
/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="12 days embryo" 
154. .1200 

/note="unnamed protein product; hypothetical Proline-rich 

region/Cytochrome c family heme-binding site containing 

protein ( InterPro | IPR000694 InterPro | IPR00034 5, evidence: 

InterPro) 

putative" 

/codon_start=l 

/protein_id="BAC37305 . 1 " 

/db_xref="GI: 26347313" 

/ translation="MPFLWGLRQDKEACVGTNNQSYICDTGHCCGQSQCCNYYYELWW 
FWLVWTWI ILSCCCVCHHRRAKHRLQAQQRQHEINLIAYREAHNYSALPFYFRFLPN 
SLLPPYEEWNRPPTPPPPYSAFQLQQQQQLLPPPPQGGPPGGSPPGADPPPQGSQGA 
QSSPLSGPSRSSTRPPSVADPQSPEVPTDREATKASGTESGSPMAGHGELDPGAFLDQ 
DSECKEELLKDSRSERGGVSPDSEDKTPGRHRRFTGDSGIEVCVCNRGHHDDDLKEFN 
TLIDDALDGPLDFCDSCHVRPPVDEEEGLCLSSEGQAREHGHPHLPRPPACLLLNTIN 
EQDSPNSQHSGSPS" 
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Alignment Scores: 



Pred. No.: 


1.19e-117 
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2702 
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1651.00 


Matches : 


300 


Percent Similarity: 


89.9% 


Conservative : 


13 


Best Local Similarity: 


86.2% 


Mismatches : 


29 


Query Match: 


•86.5% 


Indels : 


6 


DB: 


6 


Gaps : 


3 



US-10-729-895A-2 (1-342) x AK078491 (1-2702) 

Qy 



Db 



1 MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 20 
i t II II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
154 ATGCCTTTCCTCTGGGGGCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 213 
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Ov 


21 


SerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 


40 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IJ 1 




Db 


214 


AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 


273 


Ov 

Vy 


4 1 


GluLeuTrpT rpPheTrpLeuValTrpThrllellellelleLeuSerCysCysCysVal 


60 






1 1 i 1 1 i 1 1 1 1 I 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 :::::: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I ! 




Db 


274 


GAACTCTGGTGGTT CTGGCTGGTGTGGACCGTCGTCAT CATTCTGAGCTGCTGCTGTGTC 


333 


Ov 

v./ 


61 


CysHisHisArgArgAlaLysHisArgLeuGl nAlaGlnGlnArgGlnHisGluIleAsn 


80 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 




Db 


334 


TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCTCAGCAGCGGCAACATGAAATCAAC 


393 


Ov 


81 


LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 


100 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


394 


CTGATCGCTTACCGGGAAGCCCACAACTACTCAGCGTTGCCGTTTTACTTCAGGTTTTTG 


4 53 


Ov 


101 


ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 


120 






I 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


454 


CC AAACTCTTTGCT AC CTCCTT AT GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


513 


Ov 
vy 


121 




137 






1 1 1 1 MINIMI 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 




Db 


514 


CCGTACAGTGCCTTCCAGCTCCAACAGCAGCAGCAGCTGCTGCCACCTCCTCCTCAGGGT 


573 


Ov 
vy 


138 


GlyProAlaGlyGlySerProProGlyl leAsp ProThrArgGlySerGlnGlyAla 


156 






1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 M 1 1 M 1 1 i 1 III :: : 1 1 II 1 II II 1 1 1 1 1 1 




Db 


574 


GGCCCTCCAGGTGGCAGCCCCCCAGGCGCTGACCCGCCGCCCCAGGGCTCCCAGGGAGCG 


633 


Qy 


157 


GlnSerSerProLeuSerGluProSerArgSerSerThrArgProProSerlleAlaAsp 


176 






111111111111111111 II 1 1 II II 1 1 II 1 1 1 1 1 1 M M II 1 1 M M : : M II M 1 




Db 


634 


CAGAGCAGTCCTTTGTCCGGACCGAGCCGAAGCAGCACAAGACCTCCAAGTGTCGCAGAT 


693 


Ov 
vy 


• 177 


ProAspProSerAspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluPro 
Mi :::::: | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 ill III 


196 


Db 


694 


CCTCAGTCCCCTGAAGTGCCCACTGACCGGGAAGCCACCAAAGCCTCTGGGACGGAGTCT 


753 


Ov 
vy 


197 


SerGlySerValAlaGlyLeuGlyGluLeuAspProGlyAla PheLeuAspLysAspAla 


216 






: : M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M M II 1 II II 1 1 1 II 1 1 : : : 1 1 1 : : : 




Db 


754 


GGT AGCCCCATGGCCGGCCACGGGGAGCTGGACC CGGGCGCCTT CCT GGACCAGGATT CC 


813 


Ov 
vy 


217 


Gl uCysArgGluGluLeuLeuLysAspAspSerSerGluHisGly AlaProAsp 


234 






1 1 1 1 1 1 ::: 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 III : : : 1 1 1 1 1 1 




Db 


814 


GAGTGCAAGGAGGAGCTACT GAAAGATTCGCGTT CTGAGCGT GGCGGCGTGTCCCCTGAC 


873 


Ov 
vy 


235 


SerLysGluLysThrProGlyArgHisArgArgPheThrGlyAspSerGlylleGl uVal 


254 






1 1 1 :::::: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


874 


AGTGAAGACAAGACCCCCGGCAGGCATCGCCGCTTCACAGGTGACTCAGGCATTGAGGTG 


933 


Ov 
vy 


255 


CysValCysAsnArgGlyHisHisAspAspAspLeuLysGluPheAsnThrLeu IleAsp 


274 






1 t 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 M M 1 1 1 1 M M 1 1 II M 1 1 1 1 1 M 1 1 1 M 1 M M 1 1 




Db 


934 


TGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTTAACACGCTCATAGAT 


993 


Ov 
vy 


275 


AspAlaLeuAspGlyProLeuAspPheCysAspSerCysHisVa lArgProProGlyAsp 


294 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 II II 1 1 1 1 1 1 II 1 III 




Db 


994 


GAT GCT CT GGAC GGGC C C CT GGACTT CT GT GACAGCT GT C AT GT ACGGCCT C CT GT T GAT 


1053 


Ov 


. 295 


GluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArgGluProGlyHisProHis 


314 






1 1 1 II II II 1 1 1 M II 1 1 Ml 1 M 1 II II 1 1 M 1 1 M M M II II 1 II 1 1 1 




Db 


1054 


GAAGAGGAAGGTCTCTGCCTGTCCTCGGAGGGTCAGGCTCGGGAGCATGGGCACCCCCAC 


1113 


Qy 


315 


LeuProArgProProAlaCysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsn 


334 






1 1 1 1 II II II 1 1 1 1 II II 1 1 1 1 1 M II 1 1 M II M M 1 1 1 1 II 1 1 1 1 ! M 1 M M II II 1 




Db 


1114 


CTGCCACGGCCGCCTGCGTGTCTGCTGCTAAACACCATCAATGAGCAGGACTCTCCAAAT 


1173 


Ov 
vy 


335 


SerGlnSerSerSerSerProSer 342 




Db 


1174 


1 1 i 1 II III 1 1 II 1 1 1 1 1 

TCCCAGCACAGTGGCTCCCCGAGC 1197 




RESULT 4 








AK142695 








LOCUS 


. AK142695 3871 bp mRNA linear HTC 21-SEP- 


•2005 


DEFINITION Mus musculus 16 days neonate heart cDNA, RIKEN full-length enriched 



library, clone :D830007O22 product : hypothetical protein, full insert 
sequence. 
ACCESSION AK142695 

VERSION AK142695.1 GI:74216502 
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Submitted (30-MAR-2004 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN), Laboratory for Genome 
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RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
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Fax:81-45-503-9216) 
COMMENT cDNA library. was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science • Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL: http : //genome . gsc . riken . jp/ 
URL: http : //fan torn. gsc. riken . jp/ . 
FEATURES Location/Qualifiers 
source 1. .3871 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/ db_x re f = " FANT0M_DB : D8 3 00 07 02 2 " 

/db_xref="taxon: 10090" 

/clone="D830007022" 

/tissue_type="heart" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="16 days neonate" 
CDS 154. .1200 

/note="unnamed protein product; hypothetical protein 

(evidence: CDS) 

putative" 

/codon_start=l 

/protein_id="BAE2 5164 .1" 

/db_xref="GI: 74216503" 

/translation="MPFLWGLRQDKEACVGTNNQSYICDTGHCCGQSQCCNYYYELWW 
FWLVWTWI ILSCCCVCHHRRAKHRLQAQQRQHEINLIAYREAHNYSALPFYFRFLPN 
SLLPPYEEWNRPPTPPPPYSAFQLQQQQQLLPPPPQGGPPGGSPPGADPPPQGSQGA 
QSSPLSGPSRSSTRPPSVADPQSPEVPTDREATKASGTESGSPMAGHGELDPGAFLDQ 
DSECKEELLKDSRSERGGVSPDSEDKTPGRHRRFTGDSGIEVCVCNRGHHDDDLKEFN 
TLIDDALDGPLDFCDSCHVRPPVDEEEGLCLSSEGQAREHGHPHLPRPPACLLLNTIN 
EQDSPNSQHSGSPS" 

polyA_signal 384 6. .3851 

/note="putative" 

polyA_site 3871 

7 no te=" putative " 
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US-10-729-895A-2 (1-342) x AK142695 (1-3871) 



Qy 


1 


Db 


154 


Qy 


21 


Db 


214 


Qy 


41 


Db 


274 


Qy 


61 


Db 


334 


Qy 


81 


Db 


394 


Qy 


101 


Db 


454 



I I I II I I I II I I I I I I i I I II I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 1 



SerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 4 0 
I I I I I I I i I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 1 I I 

AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 273 

GluLeuTrpTrpPheTrpLeuValTrpThrllellellelleLeuSerCysCysCysVal 60 

III I I I I I I I I I I II I I I I I I I I I I :::::: I I I I I I I I I I I M I I I I I I I I I I I 

GAACTCTGGTGGTTCTGGCTGGTGTGGACCGTCGTCATCATTCTGAGCTGCTGCTGTGTC 333 

CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 80 
I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II II I I I I 

TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCTCAGCAGCGGCAACATGAAATCAAC 393 

LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 100 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CTGATCGCTTACCGGGAAGCCCACAACTACTCAGCGTTGCCGTTTTACTTCAGGTTTTTG 4 53 

ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I 

CCAAACTCTTTGCTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 513 
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Qy 


121 


Db 


514 


Qy 


138 


Db 


574 


Qy 


157 


Db 


634 


Qy 


177 


Db 


694 


Qy 


197 


Db 


754 


Qy 


217 


Db 


814 


Qy 


235 


Db 


874 


Qy 


255 


Db 


934 


Qy 


275 


Db 


994 


Qy 


295 


Db 


1054 


wy 


■313 


Db 


1114 


Qy 


335 


Db 


1174 



ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeu ProProGlnCys 137 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 111 I I I I I I I I I 

CCGTACAGTGCCTTCCAGCTCCAACAGCAGCAGCAGCTGCTGCCACCTCCTCCTCAGGGT 573 

GlyProAlaGlyGlySerProProGlylleAsp ProThrArgGlySerGlnGlyAla 156 

I I I I I I I I I I I I I I I I I II I I I I I III III ::: I I I I II I I I I I I I I I 
GGCCCTCCAGGTGGCAGCCCCCCAGGCGCTGACCCGCCGCCCCAGGGCTCCCAGGGAGCG 633 

GlnSerSerProLeuSerGluProSerArgSerSerThrArgProProSerlleAlaAsp 176 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I ::: I I I I I I 

C AGA GC AGT C CT TT GT CC GG AC CG AGC C GAAGC A GC AC AAGAC CT C C AAGT GT C GC AG AT 693 

ProAspProSerAspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluPro 196 

III :::::: | | | MINI I I I I I I I I I I I I III III 
CCTCAGTCCCCTGAAGTGCCCACTGACCGGGAAGCCACCAAAGCCTCTGGGACGGAGTCT 7 53 



216 



: :: I I I I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I : :: I I I :: : 

GGT AGCCC CAT GGCCGGC CACGGGGAGCT GGACCCGGGCGCCTT CCT GGACC AGGATT CC 



813 



GluCysArgGluGluLeuLeuLysAspAspSerSerGluHisGly AlaProAsp 234 

I I I I I I :: : I I I I I I I I I I I I I I I I I I I I I I I I III : :: I I I I I I 

GAGT GC AAGGAGGAGCT ACT GAAAGATT CGCGTT CT GAGCGT GGCGGCGT GT CCCCT GAC 873 

SerLysGluLysThrProGlyArgHisArgArgPheThrGlyAspSerGlylleGluVal 2 54 
I I I :::::: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I M I I I I I I I 
AGT GAAGAC AAGAC CCCCGGCAGGC AT C GC CGCT T C AC AGGT GACT C AGGC ATT GAGGT G 933 



i I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

T GTGT GTGCAAC CGGGGC CACC ATGACGATGACCTCAAAGAGTT TAAC ACGCTCAT AGAT 993 



294 



I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I III 
GATGCTCTGGACGGGCCCCTGGACTTCTGTGACAGCTGTCATGTACGGCCTCCTGTTGAT 



1053 



GluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArgGluProGlyHisProHis 314 
I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
GAAGAGGAAGGT CT CTGCCTGT CCTCGGAGGGTCAGGCTCGGGAGC AT GGGC ACCC CCAC 1113 



334 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGCCACGGCCGCCTGCGTGTCTGCTGCTAAACACCATCAATGAGCAGGACTCTCCAAAT 



1173 



I I I I I I 



t I 



I I I I I I I I 



RESULT 5 
AK154686 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 



AK154686 3893 bp mRNA linear HTC 21-SEP-2005 

Mus musculus NOD-derived CDllc +ve dendritic cells cDNA, RIKEN 
full-length enriched library, clone : F630103G18 product : hypothetical 
protein, full insert sequence. 
AK154686 

AK154686.1 GI:74185772 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol. 303, 19-44 (1999) 
10349636 
2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) 

11042159 

3 
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AUTHORS Shibata, K. , Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., Carninci,P., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine ,T . , ■ Harada , A. , 
Yamamoto, R. , Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi, K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Murama tsu, M. , Inoue,Y., Kira,A. and 'Hayashizaki, Y . 

TITLE RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 mul ticapillary sequencer 

JOURNAL Genome Res. 10 (11), 1757-1771 (2000) 
PUBMED 11076861 
REFERENCE 4 

AUTHORS Kawai,J., Shinagawa , A. , Shibata, K. , Yoshino,M., Itoh,M., Ishii,Y., 
Arakawa,T., Hara,A., Fukunishi , Y . , Konno,H., Adachi,J., Fukuda,S., 
Aizawa,K., Izawa,M., Nishi,K., Kiyosawa,H., Kondo,S., Yamanaka, I . , 
Saito,T., Okazaki,Y., Gojoborl,T . , Bono,H., Kasukawa,T . , Saito,R., 
Kadota,K., Matsuda,H., Ashburner,M. , Batalov,S., Casavant,T., 
Fleischmann, W. , Gaasterland, T . , Gissi,C, King,B., Kochiwa,H., 
Kuehl,P., Lewis, S., Matsuo,Y., Nikaido,I., Pesole,G., 
Quackenbush, J. , Schriml, L.M. , Staubli,F., Suzuki, R., Tomita,M., 
Wagner, L . , Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli, R. , Barsh,G., Blake, J., Boffelli,D., Bojunga,N., 
Carninci,P., de Bonaldo, M. F. , Br owns te in, M. J. , Bult,C, 
Fletcher, C, Fujita,M., Gariboldi,M. , Gustincich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M., Lee, N . H .-, Lyons, P., 
Marchionni,L. , Mashima,J., Mazzarelli, J. , Mombaerts, P. , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez, I . , Sakamoto, N., Sasaki, H., 
Sato,K., Schonbach,C. , Seya,T., Shibata, Y., Storch, K . F . , Suzuki,H., 
Toyo-oka,K., Wang,K.H., Weitz,C, Whittaker , C . , Wilming,L., , 
Wynshaw-Boris, A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

CONSRTM RIKEN Genome Exploration Research Group Phase II Team and the 
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SCORE Search Results Details for Application 
10729895 and Search Result 
20070112__180212_us-10-729-895a-2.rni. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 
20070112_180212_us-10-729-895a-2.rni. 

start I next page 

Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: January 16, 2007, 05:14:59 ; Search time 245 Seconds 

{without alignments) • 
3917.873 Million cell updates/sec 

Title: US-10-72 9-895A-2 

Perfect score: 1909 

Sequence: 1 MPFLLGLRQDKEACVGTNNQ LLNTINEQDSPNSQSSSSPS 342 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 



Searched: 1403666 seqs, 935554401 residues 

Total number of hits satisfying chosen parameters: 2807332 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=f rame+_p2n. model -DEV=xlh 

-Q=/abss/ABSSWEB_spool/US10729895/runat_12012007_180212_7 416/app_query. fasta_l 
-DB=Issued_Patents_NA -QFMT=fastap -SUFFIX=rni -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE=LOCAL -0UTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-HOST=abss03h -USER=US10729895_@CGN_l_l_204_@runat_12012007_180212_74 16 
-NCPU=6 -ICPU=3 -N0_MMAP -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 
-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database : Issued_Pa tents^NA: * 

1: /EMC_Celerra_SIDS3/ptodata/2/ina/l_COMB.seq:* 

2: /EMC_Celerra_SIDS3/ptodata/2/ina/5_COMB.seq:* 

3: /EMC_Celerra_SIDS3/ptodata/2/ina/6A_COMB.seq:* 

4 : /EMC_Celerra_SIDS3/ptodata/2/ina/6B_COMB . seq : * 

5: /EMC_Celerra_SIDS3/ptodata/2/ina/7_COMB.seq:* 

6: /EMC_Celerra_SIDS3/ptodata/2/ina/H_COMB.seq:* 

7: /EMC_Celerra_SIDS3/ptodata/2/ina/PCTUS_COMB.seq:* 

8: /EMC_Celerra_SIDS3/ptodata/2/ina/PP_COMB.seq:* 

9: /EMC_Celerra_SIDS3/ptodata/2/ina/RE_COMB.seq:* 

10: /EMC_Celerra_SIDS3/ptodata/2/ina/backfilesl .seq:* 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 





No . 


Score 


Match 




DB 


ID 


Description 




1 


1901 


99 


6 


2703 


4 


US-10-094-749-823 


Sequence 


823, App 




2 


4 69 . 5 


24 


6 


2042 


4 


US-10 -094 -7 49-344 


Sequence 


34 4 , App 




3 


4 62 


24 


2 


399 


3 


iic-nQ-SI 3-qqqp-?7d 

U J U5 D J- J .? ./ ?^ Z / 4 


Sequence 


274 , App 




4 


4 62 


24 


2 


399 


3 


US-09-471-276-82 


Sequence 


82 , Appl 




5 


4 52 


.23 


7 


1292 


3 


nc_rjQ-?4 7_i cc.ioo 

»J J \J ~? c* " ' X J J 1U£ 


Sequence 


182, App 




6 


4 52 


23 


7 


1292 


3 


lJ^-OQ-903-1 90-18? 


Sequence 


182, App 




7 


329.5 


17 


3 


861 


3 


nq-09-24 7-1 55-83 


Sequence 


83, Appl 




8 


329.5 


17 


3 


861 


3 


nq-nQ-Q03-l 90-83 


Sequence 


83, Appl 




9 


253 . 5 


13 


3 


4 89 


3 


nq-nq-c;-} 3-qqqr-374fi 


Sequence 


3746, Ap 




10 


1 50 . 5 


7 


9 


2196 


■j 


no-oq-qd q-01 6-^70.6 


Sequence 


5786, Ap 




1 1 


145.5 


7 


6 


17098 


3 


nq-oq-9dq-oi 6-1 3i l q 


Sequence 


13119, A 




12 


14 2.5 


7. 


5 


8 99 


4 


nq-nq-?q7-fi48-?68fi 


Sequence 


2686, Ap 




i j 


1 d 1 

i 4 i 


7. 


4 


4 ouo 


-j 
o 


lie — oq — d7 R-R 1 ^-7^ 

Uj~U3 1 / D D 1 D / O 


Sequence 


76, ' Appl 




1 4 


l d i 
141 


7. 


4 




-i 


ijc-oq — d7 R-7 d 

UO UrJ 4 /D DID /4 


Sequence 


74, Appl 


c 


1 D 


1 A ~\ 
1 4 1 


7, 


4 






TTC_na_Q/1 Q — fll £ — 1 7 99 r 

Uj-uy-y^y-uio-i /zzo 


Sequence 


17225, A 




1 

J. D 


1 ^8 R 
1 JO . 3 


7. 


3 


4 4 U J 1 O D 


j 


iic_ riQ- 1 n fid oa-9 


Sequence 2, Appli 




17 


138.5 


7. 


3 


d d 1 1 S9 Q 
4 4 11J^2 


-a 
J 


n q— riQ- 1 m— fld oa-i 


Sequence 1, Appli 




1 n 

1 o 


1 ^7 

i j / 


7. 


2 


1 rid o 

1U4Z 


-J 


ITC-OQ-^'^-RRq — R9Q9 


Sequence 


52 92, Ap 




1 Q 


136.5 


7 . 


2 


3671 


-3 
-J 


HQ— nq-^ft9-RR9-9^9 
UoUjjOZDDZZOZ 


Sequence 


232, App 




90 
z u 


1 jD ■ J 


7. 


2 


CQ1 C 

jjI J 


"3 




Sequence 


3, Appli 




Z 1 


1 JO • J 


7 . 


2 


£9 A O 
OZ 4 U 


o 
J 


IIC HQ "?Q9 RCO 11T 

Uo— uy — JO z — DDZ — 1 1 / 


Sequence 


117, App 




22 


136.5 


7 . 


2 




-3 
J 


nQ_nQ_^P9_RR9_91 


Sequence 


21, Appl 




Z J 


1 JO . J 


7. 


2 


£Q 1 O 


-3 


Uj~ U?" OuZ~D0Z~10 


Sequence 


15, Appl 




9/1 
Z 4 


1 JO > J 


7 . 


2 


CQI 1 




UoUy— JOZ DDZ1 


Sequence 


1, Appli 




2 5 


136.5 


7. 


2 


691 1 


3 


U3 Jul jj£ 14 


Sequence 


14, Appl 




26 


136.5 


7 . 


2 


6911 


3 


TIC — nq — 1R9-SR9-1 
Uo UiJ JOZ DDZ lD 


Sequence 


16, Appl 




9 7 

z / 


IjO. J 


7. 


2 




J 


Uj" uy~jo^ —DDZ 1 / 


Sequence 


17, Appl 




28 


136.5 


7. 


2 


6911 


3 


i]c_nq_ TQ9_ c;r9_i p 

Uo U!7 JOZ DDZ1o 


Sequence 


18, Appl 




29 


136.5 


7. 


2 


6911 


3 


TTC_nq-lfi 9 — SR9 -1 q 

Uj Uj/ jOZ DDZ15 


Sequence 


19, Appl 




30 


136.5 


7. 


2 


6911 


o 


11^—00 — ^09 — ^^9 — 90 
Uo~UyjoZDDZZU 


Sequence 


20, Appl 




3 1 


136.5 


7 . 


2 


6912 


3 


11^ — 00 — ^^9 — ^^9 — 1 ^ 
U0 UZ7 D0Z DDZU 


Sequence 


13, Appl 




32 


136 


7. 


1 


8280 


5 


HQ— oq~ q^fi — 97 1 p_ fiR 


Sequence 


65, Appl 


c 


33 


136 " 


7. 


1 


10080 


5 


US-09-936-271C-97 


Sequence 


97, Appl 


c 


34 


136 


7. 


1 


14077 


3 


US-09-902-540-1109 


Sequence 


1109, Ap 




35 


135.5 


7. 


1 


25431 


3 


US-09-949-016-13234 


Sequence 


13234, A 


c 


36 


135.5 


7. 


1 


134008 


3 


US-09-949-016-13841 


Sequence 


13841, A 




37 


135 " 


7. 


1 


5877 


3 


US-10-152-886-54 


Sequence 


54, Appl 


c 


38 


134 .5 


7. 


0 


999 


3 


US-09-902-540-2460 


Sequence 


24 60, Ap 


c 


39 


134 


7. 


0 


30656 


3 


US-09-949-016-14613 


Sequence 


14613, A 


c 


40 


132 


6. 


9 


2214 


3 


US-08-864-038A-1 


Sequence 


1, Appli 


c 


41 


132 


6. 


9 


3331 


3 


US-08-864-038A-2 


Sequence 


2, Appli 


c 


42 


132 


6. 


9 


3331 


3 


US-08-864-038A-4 


Sequence 


4, Appli 




43 


131 


6. 


9 


2889 


2 


US-08-537-002A-4 


Sequence 


4, Appli 




44 


131 


6. 


9 


2889 


3 


US-08-863-010-4 


Sequence 


4, Appli 




45 


131 


6. 


9 


2889 


3 


US-09-024-429-4 


Sequence 


4, Appli 



ALIGNMENTS 



RESULT 1 

US-10-094-749-823 

Sequence 823, Application US/10094749 
Patent No. 6979557 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



ISOGAI, TAKAO 
SUGIYAMA, T0M0YASU 
0TSUKI, TETSUJI 
WAKAMATSU, AI 
SATO, HIR0YUKI 
ISHII, SHIZUKO 
YAMAMOT0, JUN-ICHI 
IS0N0, YUUK0 
HI0, YURI 
0TSUKA, KA0RU 
NAGAI, KEIICHI 
IRIE, RY0TAR0 
TAMECHIKA, ICHIRO 
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APPLICANT: SEKI, NAOHIKO 
APPLICANT : YOSHIKAWA, TSUTOMU 
APPLICANT: OTSUKA, MOTOYUKI 
APPLICANT: NAGAHARI, KEN JI 
APPLICANT: MASUHO, YASUHIKO 
TITLE OF INVENTION: NOVEL FULL-LENGTH cDNA 
FILE REFERENCE: 084335/0160 
CURRENT APPLICATION NUMBER: US/10/094,749 
CURRENT FILING DATE: 2002-03-12 
PRIOR APPLICATION NUMBER: 60/350,435 
PRIOR FILING DATE: 2002-01-24 
PRIOR APPLICATION NUMBER: JP 2001-328381 
PRIOR FILING DATE: 2001-09-14 
NUMBER OF SEQ ID NOS: 3381 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 823 
LENGTH: 2703 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-094-749-823 



Alignment Scores: 



Pred. No.: 


4.81e-140 


Length: 


2703 


Score: 


1901.00 


Matches : 


340 


Percent Similarity: 


99.7% 


Conservative: 


1 


Best Local Similarity: 


99.4% 


Mismatches: 


1 


Query Match: 


99.6% 


Indels: 


0 


DB: 


4 


Gaps : 


0 



US-10-729-895A-2 (1-342) x US-10-094-74 9-823 (1-2703) 



Qy 


i 


Db 


145 


Qy 


21 


Db 


205 


Ov 


4 1 


Db 


265 


Qy 


61 


Db 


325 


Qy 


81 


Db 


385 


Qy 


101 


Db 


445 


Qy 


121 


Db 


505 


Qy 


141 


Db 


565 


Qy 


161 


Db 


625 


Qy 


181 


Db 


685 


Qy 


201 


Db 


74 5 


Qy 


221 



MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 20 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I 
ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 204 



40 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



264 



GluLeuTrpTrpPheTrpLeuValTrpThrllellellelleLeuSerCysCysCysVal 
I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I II I I I I I I I I ! I i I I I I I I I I I I I I I I I 

GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 

CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 
I I I I I I I I I I I M M I I I I I I I i I I I I I 1 i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 

LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 



ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



60 



324 



80 



384 



100 



120 



504 



ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 
I I I I i I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 



140 



564 



160 



624 



LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 



180 



TT GT CT GAGCCCAGCAGAAGCAGC ACAAGACCCCCAAGCATCGCT GACCCTGAT CCCT CT 684 

AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 200 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I 

GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 74 4 

AlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGlu 220 
I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GCT GGCCT GGGGGAGCTGGACCCGGGGGCCTT CCTGGACAAAGATGCAGAAT GT AGGGAG 804 

GluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrPro 240 
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Db 


805 


Qy 


241 


Db 


865 


Qy 


261 


Db 


925 


Qy 


281 


Db 


985 


Qy 


301 


Db 


1045 


Qy 


321 




i i rm 

1 1*J D 


Qy 


341 


Db 


1165 


RESULT 2 




US-10-094 


-749 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 

GAGCTGCT GAAAGAT GAC AGCT CT GAAC ACGGCGCACC CGAC AGCAAAGAGAAGACGCCT 864 



I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGGAGACAT C GCCGCT T C AC AGGT GACT CGGGC ATT GAAGT GT GT GTGT GC AACCGGGGC 924 



I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



LeuAspPheCysAspSerCysHisValArgProProGlyAspGluGluGluGlyLeuCys 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I II 
CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 104 4 

GlnSerSerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 320 
I ! ! I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I::: 
CAGCCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCTCA 1104 

CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 34 0 
I I I I II I I I I I I I II I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 1164 



I I I I I I 



Sequence 344, Application US/10094749 
Patent No. 6979557 
GENERAL INFORMATION: 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 



ISOGAI, TAKAO 
SUGIYAMA, TOMOYASU 
OTSUKI, TETSUJI 
WAKAMATSU, AI 
SATO, HIROYUKI 
ISHII, SHIZUKO 
YAMAMOTO, JUN-ICHI 
ISONO, YUUKO 
HIO, YURI 
OTSUKA, KAORU 
NAGAI , KEIICHI 
IRIE, RYOTARO 
TAMECHIKA, ICHIRO 
SEKI, NAOHIKO 
YOSHIKAWA, TSUTOMU 
OTSUKA, MOTOYUKI 
NAGAHARI, KEN J I 
MASUHO, YASUHIKO 
TITLE OF INVENTION: NOVEL FULL-LENGTH cDNA 
FILE REFERENCE: 084335/0160 
CURRENT APPLICATION NUMBER: US/ 10/094 , 749 
CURRENT FILING DATE: 2002-03-12 
PRIOR APPLICATION NUMBER: 60/350,435 
PRIOR FILING DATE: 2002-01-24 
PRIOR APPLICATION NUMBER: JP 2001-328381 
PRIOR FILING DATE: 2001-09-14 
NUMBER OF SEQ ID NOS: '3381 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 344 
LENGTH: 2042 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-094-749-344 



Alignment Scores: 



Pred. No.: 


1.86e-27 


Length: 


2042 


Score: 


469.50 


Matches: 


119 


Percent Similarity: 


39.8%' 


Conservative: 


26 


Best Local Similarity: 


32.7% 


Mismatches: 


96 


Query Match: 


24.6,% 


Indels: 


123 


DB: 


4 


Gaps: 


12 



US-10-729-895A-2 (1-342) x US-10-094-749-344 (1-2042) 
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Qy 

Db 


11 
1102 


LysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHisCysCys 
:: : 1 i 1 III III 1 1 1 1 1 II 1 1 1 1 1 ::: 1 1 1 :::::: 1 1 1 1 1 1 1 1 1 1 1 1 
CGAGAGCTGTGCCCAGGAGTGAACAACCAGCCCTACCTCTGTGAGAGTGGTCACTGCTGC 


30 

1161 


Qy 

Db ' 


31 
1162 


GlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpT.rpPheTrpLeuValTrpThr 
III:::::: 1 1 1 1 1 1 1 1 t 1 t 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 ::: 1 1 1 1 1 1 
GGGGAGACTGGCTGCTGCACCTACTACTATGAGCTCTGGTGGTTCTGGCTGCTCTGGACT 


50 

1221 


Qy 

Db 


51 
1222 


IlellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHisArgLeu 
:::::: 1 1 1: :: 1 1 1 i 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 
GTCCTCATCCTCTTTAGCTGCTGTTGCGCCTTCCGCCACCGACGAGCTAAACTCAGGCTG 


70 

1281 


Qy 

Db 


71 
1282 


GlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArgGluAla HisAsn 

III 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ::: 1 1 1 1 1 1 lit III 
CAACAACAGCAGCGGCAGCGTGAAATCAACTTGTTGGCCTATCATGGGGCATGCCATGGG 


89 

1341 


Qy 


90 


TyrSerAlaLeuPro PheTyrPheArgPheLeuProAsnTyrLeuLeu 

III 1 1 1 1 1 II 1 1 ::: 
GCTGGTCCTTTCCCTACCGGTTCACTGCTTGACCTTCGCTTCCTCAGCACCTTC7\AGCCC 


105 


Db 


1342 


1401 


Qy 

Db 


106 
1402 


ProProTyrGluGluValValAsnArgProProThrProProProProTyrSerAlaPhe 
III ' 1 1 1 1 1 1 :: : 1 1 1 1 1 1 : :: II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 II 1 1 :: : 
CCAGCCTACGAGGATGTGGTTCACCGCCCAGGCACACCACCCCCCCCTTATACTGTG— 


125 
1458 


Qy 


126 


GlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAlaGlyGlySerProPro 


145 


Db 


1458 




1458 




■Qy 


146 


GlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerProLeuSerGluProSer 

IN 1 1 1 1 .1 1 : : : III 
GCCCCAGGCCGCCCCTTGACTGCTTCCAGT 


165 


Db 


1459 


1488 


Qy 


166 


ArgSerSerThrArgProProSerlleAlaAspProAspProSerAspLeuProValAsp 
::: III III III 


185 


Db 


1489 


GAACAAACCTGCTGTTCCTCCTCA TCCAGCTGCCCTGCCCAC 


1530 


Qy 


186 


ArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerValAlaGlyLeuGlyGlu 

III III::: 
TTTGAAGGAACAAATGTGGAAGGT GTT 


205 


Db 


1531 


1557 


Qy 


206 


LeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGluGluLeuLeuLysAsp 


225 


Db 


1557 




1557 




Qy 


226 


AspSerSerGluHisGlyAlaProAspSerLysGluLysThrProGly 

1 1 1 1 II llllll : : : 1 II 1 1 1 II 1 
TCCTCCCACCAGAGTGCCCCCCCTCATCAGGAGGGTGAGCCCGGGGCAGGGGTGACC 


241 


Db 


1558 


1614 


Qy 

Db 


242 
1615 


■ ArgHisArgArgPheThrGlyAspSerGlylleGlu 

1 1 1 : :: 1 U 1 1 1 1 1 1 1 II 1 1 1 1 i 1 1 II 1 1 1 
CCTGCCTCCACACCCCCCTCCTGCCGCTATCGCCATTTAACTGGCGACTCCGGTATTGAG 


253 
1674 


Qy 


254 


ValCysValCysAsnArgGlyHisHisAspAspAspLeuLysGluPheAsn 

: : : II 1 III :: : : : : 1 1 II 1 1 

CT CT GC CCTT GT CCTGCCTC CGGT GAGGGT GAGC C AGT C AAGGAGGT G AGGGTT AGT GCC 


270 


Db 


1675 


1734 


Qy 


271 


ThrLeuIleAspAspAlaLeuAspGlyProLeuAspPheCysAspSerCysHisValArg 
IIUII III III::: 111 :: : 
ACCCTGCCAGAT CTGGAGGACTACTCCCCGTGTGCACTACCC 


290 


Db 


1735 


1776 


Qy 


291 


Pro ProGlyAspGluGluGluGlyLeuCysGlnSerSerGluGluGlnAla 

III III It II II III 


307 


Db 


1777 


CCAGAGTCTGTACCGCAGATCTTTCCCATGGGGCTGTCTTCCAGT 


1821 


Qy 


308 


ArgGluProGlyHisPro HisLeuProArgPro 

III III : :: II II 1 1 


318 


Db 


1822 


GAAGGGGACATCCCAT AAGT AGTTTT GAGAGGGT GGAT GGGT T ACTTGCCC 


1872 


Qy 


319 


ProAlaCysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSer 

1 1 1 II II 1 1 1 1 1 III 

l 1 1 t I * I 1 1 t 1 \ III 

ACCAGAAACAGCCCTAGTCCCAACTCCTTGCGTTCC 


338 


Db 


1873 


1908 


Qy 


339 


SerSerProSer 342 . 

II 1 1 1 1 
TTTGGCCCCTCC 1920 




Db 


1909 
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RESULT 3 

US-09-513-999C-274 

; Sequence 274, Application US/09513999C 
; Patent No. 6783961 
; GENERAL INFORMATION: 

; APPLICANT : Dumas Milne Edwards, J.B. 
; APPLICANT: Duclert, A. 
; APPLICANT: Giordano, J.Y. 

; TITLE OF INVENTION: Expressed Sequence Tags and Encoded Human Proteins. 

; Patent No. 6783961 

; FILE REFERENCE: 59.US2.REG 

; CURRENT APPLICATION NUMBER: US/09/513, 999C 

; CURRENT FILING DATE: 2000-02-24 

; PRIOR APPLICATION NUMBER: US 60/122,4 87 

; PRIOR FILING DATE: 1999-02-26 

; NUMBER OF SEQ ID NOS: 36681 

SOFTWARE: Patent. pm 
; SEQ ID NO 274 

LENGTH: 399 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME /KEY: CDS 
LOCATION: 81 . . 398 
FEATURE: 

NAME /KEY : sig_peptide 

LOCATION: 81.. 152 

OTHER INFORMATION: score 10 

OTHER INFORMATION: seq LLLLQALPSPLSA/RA 

FEATURE: 

NAME/KEY:. misc_feature 
LOCATION: 251 

OTHER INFORMATION: y=c or t 
US-09-513-999C-274 



399 
77 
0 
0 
0 
0 

US-10-729-895A-2 (1-342) x US-09-513-999C-274 (1-399) 



Qy 9 GlnAspLysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHis 28 

I I I I I I I I I I I I I I I I I ! I I II I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 168 CAGGAT AAGGAAGC CT GTGT GGGT ACCAAC AATC AAAGCT ACAT CT GT GACACAGGACAC 227 

Qy 29 CysCysGlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuVal 4 8 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 228 TGCTGTGGACAGTCTCAGTGCTGYAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTG 287 

Qy 4 9 TrpThrllellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHis 68 

I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I II I I I I I I I I 

Db 288 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 347 

Qy 69 ArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArg 85 

II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I 

Db 34 8 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGA 398 



RESULT 4 

US-09-471-276-82 

; Sequence 82, Application US/09471276 
; Patent No. 6822072 
; GENERAL INFORMATION: 

; APPLICANT: Dumas Milne Edwards, J.B. 

; APPLICANT: Duclert A. 

; APPLICANT: Giordano, J.Y. 

TITLE OF INVENTION: Expressed Sequence Tags and Encoded Human Proteins. 
; Patent No. 6822072 
; FILE REFERENCE: GENSET . 025CP1 
; CURRENT 'APPLICATION NUMBER: US/09/4 71,276 
; CURRENT FILING DATE: 1999-12-21 
; EARLIER APPLICATION NUMBER: 09/057,719 
; EARLIER FILING DATE: 1998-04-09 



Alignment Scores: 

Pred. No.: 7.43e-28 Length: 

Score: 462.00 Matches: 

Percent Similarity: 100.0% Conservative: 

Best Local Similarity: 100.0% Mismatches: 

Query Match: 24.2% Indels: 

DB: 3 Gaps: 
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; EARLIER APPLICATION NUMBER: 09/069,047 

; EARLIER FILING DATE: 1998-04-28 

; EARLIER APPLICATION NUMBER: PCT/IB99/00712 

; EARLIER FILING DATE: 1999-04-09 

; NUMBER OF SEQ ID NOS : 1622 

; SOFTWARE: Patent.pm 

; SEQ ID NO 82 

LENGTH: 399 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/ KEY: CDS 
LOCATION: 81.. 398 
NAME/ KEY : sig_peptide 
LOCATION: 81.. 152 

OTHER INFORMATION: Von Heijne matrix 
OTHER INFORMATION: score 10 
OTHER INFORMATION: seq LLLLQALPSPLSA/RA 
US-09-471-276-82 



Alignment Scores: 



Pred. No.: 


7.43e-28 


Length: 


399 


Score: 


462.00 


Matches : 


77 


Percent Similarity: 


100.0% 


Conservative: 


0 


Best Local Similarity: 


100.0% 


Mismatches: 


0 


Query Match: 


24 .2% 


Indels: 


0 


DB: 


3 


Gaps: . 


0 



US-10-729-895A-2 (1-342) x US-09-471-276-82 (1-399) 



Qy 9 GlnAspLysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHis 28 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 168 CAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAAAGCTACATCTGTGACACAGGACAC 2 27 ■ 

Qy 2 9 CysCysGlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuVal 4 8 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 228 T GCT GT GG AC AGT CT C AGT GCT G Y AACT AC TACT AT GAACT CT GGT GGTT CT GGCT GGTG 287 

Qy 4 9 TrpThrllellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHis 68 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I 
Db 288 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 347 

Qy 69 ArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArg 85 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 348 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGA 398 

RESULT 5 

US-09-247-155-182 

; Sequence 182, Application US/09247155A 
; Patent No. 6312922 
; GENERAL INFORMATION: 

; APPLICANT: Dumas Milne Edwards, Jean-Baptiste 

; APPLICANT: Duclert, Aymeric 

; APPLICANT: Bougueleret, Lydie 

; TITLE OF INVENTION: Complementary DNAs 

;■ FILE REFERENCE: GENSET . 02 1A 

; CURRENT APPLICATION NUMBER: US/09/247, 155A 

; CURRENT FILING DATE: 1999-02-09 

; EARLIER APPLICATION NUMBER: 60/074,121 

; EARLIER FILING DATE: 1998-02-09 

; EARLIER APPLICATION NUMBER: 60/081,563 

; EARLIER FILING DATE: 1998-04-13 

; EARLIER APPLICATION NUMBER: 60/096,116 

; EARLIER FILING DATE: 1998-08-10 

; EARLIER APPLICATION NUMBER: 60/099,273 

; EARLIER FILING DATE: 1998-10-04 

; NUMBER OF SEQ ID NOS: 182 

SOFTWARE: Patent.pm 
; SEQ ID NO 182 

LENGTH: 12 92 

TYPE: DNA 

ORGANISM: Mus musculus 
US-09-247-155-182 

Alignment Scores: 

Pred. No.: 2.34e-26 Length: 1292 
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Score: .452.00 Matches: 101 

Percent Similarity: 47.1% Conservative: 23 

Best Local Similarity: 38.4% Mismatches: 77 

Query Match: 23.7% Indels: 62 

DB: .3 Gaps: 7 



US-10-729-895A-2 (1-342) x US-09-24 7-155-182 (1-1292) 



Qy 


11 


LysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHisCysCys 

: : : 1 1 1 IN Ml III III 1 1 1 : : : 1 1 1 : : : 1 1 1 1 1 1 1 

CGAGAGCT GT GCCC AGGAGTGAAT ACCC AGCCCT ACCT CTGTGAGACT GGTC ATT GCTGT 


30 


Db 


330 


389 


Qy 

Db 


31 
390 


GlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuValTrpThr 
III:::::: 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : :: 1 1 1 1 1 1 
GGGGAGACT GGC TGCT GC AC CT ACT ACT AT GAACT C T GGT GGT T CT GGCT GC TT T GGACT 


50 
449 


Qy 

Db 


51 
450 


IlellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHisArgLeu 
:::::: 1 1 1 :: : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 
GTCCTCATCCTCTTTAGCTGCTGTTGTGCCTTCCGCCACCGAAGGGCTAAACTCAGGCTG 


70 
509 


Qy 

Db 


71 
510 


GlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArgGluAla HisAsn 

III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ::: 1 II 1 1 1 111 1 II 
CAACAGCAACAGCGGCAGCGTGAAATCAACTTGTTGGCTTACCATGGGGCATGCCACGGG 


89 
569 


Qy 


90 


TyrSerAlaLeuPro PheTyrPheArgPheLeuProAsnTyrLeuLeu 

: :: 1 1 1 III III : :: 
GCTGGCCCTGTTCCAACCGGTTCACTGCTTGACCTTCGCCTCCTCAGCGCCTTCAAACCC 


105 


Db 


570 


629 


Qy 


106 


ProProTyrGluGluValValAsnArgProProThrProProProProTyrSerAlaPhe 
ill 1 1 1 1 II : : : 1 1 1 1 1 1 : : : III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : : : 
CCAGCCTACGAGGATGTGGTTCACCACCCAGGCACACCGCCACCTCCTTACACTGTG 


125 


Db 


630 


686 


Qy 


126 


GlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAlaGlyGlySerProPro 

1 1 1 


145 


Db 


687 


GGCCCA 


692 


Qy 


146 


GlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerProLeuSerGluProSer 
III III III::: : :: III 1 1 1 1 1 I III 
GGCT AC CCTT GGACT ACT TCCAGTGAATGCACCCGCTGCTCT TCCGAATCCAGC 


165 


Db 


693 


746 


Qy 


166 


ArgSerSerThrArgProProSerlleAlaAspProAspProSerAspLeuProVa lAsp 
III::: 


185 


Db 


747 


TGCTCTGCC CAC 


758 


Qy 


186 


ArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVa lAlaGlyLeuGlyGlu 
III III::: III ^ III 
TTGGAGGGGACAAATGTAGAAGGTGTTTCCTCCCAGCAGAGTGCTCTCCCTCACCAGGAG 


205 


Db 


759 


818 


Qy 


206 


LeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGluGluLeuLeuLysAsp 

: :: III III III 
GGTGAGCCCAGGGCAGGATTG 


225 


Db 


819 


839 


wy 


lot 


AspSerSerGluHisGlyAlaProAspSerLysGluLysThr ProGlyArgHisArgArg 
III III III 1 1 1 ::: 1 1 1 1 1 1 
AGCCCAGTTCACATACCCCCTTCCTGC CGCTATCGTCGC 


Z4 0 


Db 


840 


878 


Qy 


246 


PheThrGlyAspSerGlylleGluValCysValCysAsnArgGlyHisHisAspAspAsp 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ::: 1 1 1 III : : : 
CTAACTGGTGACTCGGGTATTGAGCTCTGCCCTTGTCCTGACTCCAGTGAAGGTGAGCCA 


265 


Db 


879 


938 


Qy 


266 


LeuLysGlu 268 
1 1 1 1 1 1 1 1 1 
CTCAAGGAA 94 7 




Db 


939 




RESULT 6 
US-09-903 


-190- 


-182 





Sequence 182, Application US/09903190 
Patent No. 6936692 
GENERAL INFORMATION: 

APPLICANT: Dumas Milne Edwards, Jean-Baptiste 

APPLICANT : Duclert, ■ Aymeric 

APPLICANT: Bougueleret, Lydie 

TITLE OF INVENTION: Complementary DNAs 

FILE REFERENCE: GENSET . 021A 

CURRENT APPLICATION NUMBER: US/09/903,190 

CURRENT FILING DATE: 2001-07-11 
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PRIOR APPLICATION NUMBER: US/09/247, 155A 
PRIOR FILING DATE: 1999-02-09 

PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/074,121 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-02-09 
PRIOR APPLICATION" NUMBER: EARLIER APPLICATION NUMBER: 60/081,563 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-04-13 
PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/096,116 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-08-10 
PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/099,273 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-10-04 
NUMBER OF SEQ ID NOS: 182 
SOFTWARE: Patent.pm 
SEQ ID NO 182 
LENGTH: 1292 
TYPE: DNA 

ORGANISM: Mus musculus 
US-09-903-190-182 



Alignment Scores: 

Pred. No.: 2.34e-26 Length: 1292 

Score: 452.00 Matches: 101 

Percent Similarity: 47.1% Conservative: 23 

Best Local Similarity: 38.4% Mismatches: 77 

Query Match: 23.7% Indels: 62 

DB: 3 Gaps: 7 

US-10-729-895A-2 (1-342) x US-09-903-190-182 (1-1292) 

Qy 11 LysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHisCysCys 30 

:: : I I I Ml III III lit I I I : : : I I I : : : I I I I I I I I I I I I II I 
Db 330 CGAGAGCTGTGCCCAGGAGTGAATACCCAGCCCTACCTCTGTGAGACTGGTCATTGCTGT 389 



Qy 31 GlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuValTrpThr 50 

111:::::: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I : : : I i I I I I 
Db 390 GGGG AGACT GGC T GCT GC AC CT ACT A CT AT GAACTCT GGT GGTT CT GGCT GCTT T GGACT 449 

Qy 51 IlellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHisArgLeu 70 

:::::: I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 4 50 GTCCTCATCCTCTTTAGCTGCTGTTGTGCCTTCCGCCACCGAAGGGCTAAACTCAGGCTG 509 

Qy 71 GlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArgGluAla HisAsn 8 9 

III I I I I I I I I I II I I I I I I I I I I I I I ::: I I I I I I III III 

Db 510 CAAC AGC AAC AGCGGC AGCGT GAAAT CAACTT GT TGGCTT AC CATGGGGC AT GC C ACGGG 569 

Qy 90 TyrSerAlaLeuPro PheTyrPheArgPheLeuProAsnTyrLeuLeu 105 

: : : I I I III III : : : 

Db 570 GCT GGCCCTGTTCCAACCGGTTCACT GCTT GACCTTCGCCTCCTCAGCGCCTTCAAACCC 629 

Qy 106 ProProTyrGluGluValValAsnArgProProThrProProProProTyrSerAlaPhe 125 

III | | | | ! | :: : | | | | | | : : : III I I I I I I I I I I I I I II I I I :: : 
Db 630 CCAGCCTACGAGGATGTGGTTCACCACCCAGGCACACCGCCACCTCCTTACACTGTG— 686 

Qy 126 GlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAlaGlyGlySerProPro 14 5 

I I I 

Db 687 GGCCCA 692 

Qy 14 6 GlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerProLeuSerGluProSer 165 

III III III::: : : : III I I II I I III 

Db 693 GGCTACCCTTGGACTACTTCCAGTGAATGCACCCGCTGCTCT TCCGAATCCAGC 74 6 

Qy 166 ArgSerSerThrArgProProSerlleAlaAspProAspProSerAspLeuProValAsp 185 

III::: 

Db 74 7 TGCTCTGCC CAC 758 

Qy 186 ArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerValAlaGlyLeuGlyGlu 205 

III ill::: I I I III 

Db 759 TTGGAGGGGACAAATGTAGAAGGTGTTTCCTCCCAGCAGAGTGCTCTCCCTCACCAGGAG 818 

Qy 206 LeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGluGluLeuLeuLysAsp 225 

: :: I I I I I I III 

Db 819 GGTGAGCCCAGGGCAGGATTG 8 39 

Qy 22 6 AspSerSerGluHisGlyAlaProAspSerLysGluLysThrProGlyArgHisArgArg 24 5 

III ill III I I I ::: I I I.I I I 

Db 84 0 AGCCCAGTTCACATACCCCCTTCCTGC CGCTATCGTCGC 878 
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Qy 24 6 PheThrGlyAspSerGlylleGluValCysValCysAsnArgGlyHisHisAspAspAsp 2 65 

I I i I I i I I I I I I I I I I I I I I I ::: I I I I I I :: : 

Db 87 9 CTAACTGGTGACTCGGGTATTGAGCTCTGCCCTTGTCCTGACTCCAGTGAAGGTGAGCCA 938 

Qy 266 LeuLysGlu 268 

I I I I I I I I I 
Db 939 CTCAAGGAA 94 7 

RESULT 7 

US-09-247-155-83 • 

Sequence 83, Application US/09247155A 
Patent No. 6312922 
GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, Jean-Baptiste 
APPLICANT: Duclert, Aymeric 
APPLICANT: Bougueleret, Lydie 
TITLE OF INVENTION: Complementary DNAs 
FILE REFERENCE: GENSET . 02 1A 

CURRENT APPLICATION NUMBER: US/09/247 , 155A 
CURRENT FILING DATE: 1999-02-09 
EARLIER APPLICATION NUMBER: 60/074,121 
EARLIER FILING DATE: 1998-02-09 
.EARLIER APPLICATION NUMBER: 60/081, 563 
EARLIER FILING DATE: 1998-04-13 
EARLIER APPLICATION NUMBER: 60/096,116 
EARLIER FILING DATE: 1998-08-10 
EARLIER APPLICATION NUMBER: 60/099,273 
EARLIER FILING DATE: 1998-10-04 
NUMBER OF SEQ ID NOS : 182 
SOFTWARE: Patent. pm 
SEQ ID NO 83 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/KEY: CDS 
LOCATION: 612.. 644 
FEATURE: 

NAME/KEY: polyA_signal 
LOCATION: 829.-834 
FEATURE: 

NAME /KEY : polyA_site 
LOCATION: 850.. 8 61 
US-09-247-155-83 

Alignment Scores: 

Pred. No.: 5.93e-17 * Length: 861 

Score: 329.50 Matches: 100 

Percent Similarity: 36.0% Conservative: 20 

Best Local Similarity: 30.0% Mismatches: 90 

Query Match: 17.3% Indels: 123 

DB: 3 Gaps: 12 

US-10-729-895A-2 (1-342) x US-09-247-155-83 (1-861) 



I I I I I I I I I I I I I I 1 I I I ::: I I I I II :::::: I I I : I I I I I I I I I I I I 

CTCTGGTGGTTCTGGCTGCTCTGGACTGTCCTCATCCTCTTTAGCTGCTGTTGCGCCTTC 62 



I I I I I I I I I I I I M I 1 I I I I I i I I I I I I I I I t I I I I I I I I I I I I I I I I 
CGCCACCGACGAGCTAAACTCAGGCTGCAACAACAGCAGCGGCAGCGTGAAATCAACTTG 122 

IleAlaTyrArgGluAla HisAsnTyrSerAlaLeuPro PheTyr 96 

:: : I I I I I I • III III III 

TTGGCCTATCATGGGGCATGCCATGGGGCTGGTCCTTTCCCTACCGGTTCACTGCTTGAC 182 

PheArgPheLeuProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProPro 116 

IK III :: : I I I I I I I I I : : : I I I I I I : : : I I I I I I 

CT T C GC CT CCTC AGCACCTT CAAGCC CC C AGC CT AC GAGG AT GT GGTT C ACC GC CC AGGC 24 2 

ThrProProProProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGln 136 
I I I I I I I I I I I I I I I I I I :: : 

ACACCACCCCCCCCTTATACTGTG 266 

137 CysGlyProAlaGlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAla 156 



Qy 


42 


Db 


3 


Qy 


62 


Db 


63 


Qy 


82 


Db 


123 


Qy 


97 


Db 


183 


Qy 


117 


Db 


243 


Qy 


137 
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Db 267 



I i I 

•GCC 269 



Qy 157 GInSerSerProLeuSerGluProSerArgSerSerThrArgProProSerlleAlaAsp 176 

MINI::: III ::: III 
Db 270 CCAGGCCGCCCCTTGACTGCTTCCAGTGAACAAACCTGCTGTTCCTCCTCA 320 

Qy 177 ProAspProSerAspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluPro 196 

III III Hi III::: 
Db 321 TCCAGCTGCCCTGCCCACTTTGAAGGAACAAATGTGGAAGGTGTT 365 

Qy 197 SerGlySerValAlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAla 216 

Db 365 365 

Qy 217 GluCysArgGluGluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLys 236 

I I I I I I I I I I I I : : : 

Db 366 TCCTCCCACCAGAGTGCCCCCCCTCATCAG 395 

Qy 237 GluLysThrProGly ArgHisArg 24 4 

III I I I I I I I I I : : : I I I 

Db '396 GAGGGTGAGCCCGGGGCAGGGGTGACCCCTGCCTCCACACCCCCCTCCTGCCGCTATCGC 4 55 

Qy 24 5 ArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGlyHisHisAspAsp 264 

III I I I I I I I I I I I I I I I I ::: I I I Ml ■ ■ 

Db • 456 CGTT T AACTGGC GACT CC GGT ATT GAGCTCTGCCCTTGTCCTGCCTCC GGT G AGGGT GAG 515 

Qy 265 AspLeuLysGluPheAsn ThrLeuIleAspAspAlaLeuAspGlyProLeu 281 

: :: I I I I I I I I I I I I I I I III 
Db 516 CCAGTCAAGGAGGTGAGGGTTAGTGCCACCCTGCCAGAT CTG 557 

Qy 282 AspPheCysAspSerCysHisValArgPro ProGlyAspGluGluGluGly 298 

: : : III : : : III III' III 

Db 558 GAGGACTACTCCCCGTGTGCACTACCCCCAGAGTCTGTACCGCAGATCTTTCCCATGGGG 617 

Qy 299 LeuCysGlnSerSerGluGluGlnAlaArgGluProGlyHisPro 313 

III III III Ml 

Db 618 CTGTCTTCCAGT GAAGGGGACATCCCATAAGTAGTTTTGAGA 659 

Qy 314 HisLeuProArgProProAlaCysLeuLeuLeuAsnThrlleAsnGlu 329 

: : : I I M I f 

Db 660 GGGT GGAT GGGT T ACT TGCC C ACCAGAAAC 68 9 

Qy 330 GlnAspSerProAsnSerGlnSerSerSerSerProSer 34 2 

M I I I I M I I I I III I I I II I 

Db 690 AGCCCTAGTCCCAACTCCTTGCGTTCCTTTGGCCCCTCC 728 

RESULT 8 

US-09-903-190-83 

Sequence 83, Application US/09903190 
Patent No. 6936692 
GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, Jean-Baptiste 
APPLICANT: Duclert, Aymeric 
APPLICANT: Bougueleret, Lydie 
TITLE OF INVENTION: Complementary DNAs 
FILE REFERENCE: GEN SET . 021 A 
CURRENT APPLICATION NUMBER: US/09/903,190 
CURRENT FILING DATE: 2001-07-11 
PRIOR APPLICATION NUMBER: US/09/247 , 155A 
PRIOR FILING DATE: 1999-02-09 

PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/074,121 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-02-09 
PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/081,563 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-04-13 
PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/096,116 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-08-10 
PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/099,273 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-10-04 
NUMBER OF SEQ ID NOS: 182 
SOFTWARE: Patent. pm 
SEQ ID NO 83 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
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NAME/ KEY : CDS 
LOCATION : 612. .644 
FEATURE : 

NAME /KEY : polyA_signal 
LOCATION: 829. .834 
FEATURE: 

NAME /KEY : polyA_site 
LOCATION: 850. .861 
US-09-903-190-83 

Alignment Scores: 



Pred. No.: 


5.93e-17 


Length: 


861 


Score: 


329.50 


Matches : 


100 


Percent Similarity: 


36.0% 


Conservative: 


20 


Best Local Similarity: 


30.0% 


Mismatches: 


90 


Query Match: 


17.3% 


Indels: 


123 


DB: 


3 


Gaps: 


12 



US-10-729-895A-2 (1-342) x US-09-903-190-83 (1-861) 

LeuT rpTrpPheTrpLeuVa IT rpThr I le II el le I leLeuSe rCysCysCysVa ICys 6 1 
I I I 1. 1 I I I I I I I I I I I I I ::: I I I I I I :::::: I I I :: : I I I I I I I I I I I I 
CTCTGGTGGTTCTGGCTGCTCTGGACTGTCCTCATCCTCTTTAGCTGCTGTTGCGCCTTC 62 

HisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsnLeu 81 

! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
CGCCACCGACGAGCTAAACTCAGGCTGCAACAACAGCAGCGGCAGCGTGAAATCAACTTG 122 

PheTyr 96 



Qy 


42 


Db 


3 


Qy 


62 


Db 


63 


Qy 


82 


Db 


123 


Qy 


97 


Db 


183 


Qy 


117 


Db 


243 


Qy 


137 


Db 


267 


Qy 


157 


Db 


270 


Qy 


177 


Db 


321 


Qy 


197 


Db 


36.5 


Qy 


217 


Db 


366 


Qy 


237 


Db 


396 


Qy 


245 


Db 


456 


Qy 


265 


Db 


516 


Qy 


282 


Db 


558 



: : : I I I I I I III III I I I 



III ill :: : 111 I I I I I I : : : I I I I I I : : : I I I II I 



! I I I I I I I I I I I I I I I I I :: 



266 



III::: III : :: III 



I I I 
--GCC 269 



320 



II I ill III III::: 
-TCCAGCTGCCCTGCCCACTTTGAAGGAACAAATGTGGAAGGTGTT 365 



365 



I I 1 I I I I I I I I I ::: 

-TCCTCCCACCAGAGTGCCCCCCCTCATCAG 395 



GluLysThrProGly : ArgHisArg 24 4 

Mi I I MM lll:::HI 



II I I I I I I I I I I I I I I I I I I I I I ::: I I I III 



.ysGluPheAsn ThrLeuIleAspAspAlaLeuAspGlyProLeu 281 

Mill I I II I I III III 
^AGGAGGTGAGGGTTAGTGCCACCCTGCCAGAT CTG 557 

;ysAspSerCysHisValArgPro ProGlyAspGluGluGluGly 298 

II I : : : III Ml ill 
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Qy 299 LeuCysGlnSerSerGluGluGlnAlaArgGluProGlyHisPro 313 

Mi Ml I I I Ml 

Db 618 CTGTCTTCCAGT GAAGGGGACATCCCATAAGTAGTTTTGAGA 659 



Qy 314 HisLeuProArgProProAlaCysLeuLeuLeuAsnThrlleAsnGlu 329 

: : : I II II I 

Db 660 GGGTGGATGGGTTACTTGCCC ACCAGAAAC 68 9 

Qy 330 GlnAspSerProAsnSerGlnSerSerSerSerProSer 34 2 

I I I I I I II M I I • III M I II I 

Db 690 AGCCCT AGTCCCAACT CCTTGCGTTCCTTT GGCCCCTCC 728 



RESULT 9 

US-09-513-999C-3746 

; Sequence 3746, Application US/09513999C 
; Patent No. 6783961 
; GENERAL INFORMATION: 

; APPLICANT: Dumas Milne Edwards, J.B. 
; APPLICANT: Duclert, A. 
; APPLICANT: Giordano, J.Y. 

; TITLE OF INVENTION: Expressed Sequence Tags and Encoded Human Proteins. 

; Patent No. 6783961 

; FILE REFERENCE: 59.US2.REG 

; CURRENT APPLICATION NUMBER: US/09/513 , 999C 

; CURRENT FILING DATE: 2000-02-24 

; PRIOR APPLICATION NUMBER: US 60/122,487 

; PRIOR FILING DATE: 1999-02-26 

; NUMBER OF SEQ ID NOS : 36681 

SOFTWARE: Patent.pm 
; SEQ ID NO 3746 

LENGTH: 489 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE : 

NAME /KEY : CDS 

LOCATION: 126.. 455 

FEATURE: 

NAME /KEY : sig_peptide 

LOCATION: 126.. 233 

OTHER INFORMATION: score 9.7 

OTHER INFORMATION: seq LWTVLILFSCCCA/FX 

FEATURE: 

NAME /KEY : misc_feature 
LOCATION: 237 

OTHER INFORMATION: n=a, g, c or t 
FEATURE: 

NAME/KEY: misc_feature 
LOCATION: 281 

OTHER INFORMATION: s=g or c 
FEATURE: 

NAME/KEY: misc_feature 
LOCATION: 361 
; OTHER INFORMATION: s=g or c 
FEATURE: 

NAME/KEY: misc_feature 
LOCATION: 378 

OTHER INFORMATION: n=a , g, . c or t 
FEATURE: 

NAME/KEY: misc_feature 
LOCATION: 406 

OTHER INFORMATION: m=a or c 
FEATURE: 

NAME/KEY: misc_feature 
LOCATION: 410 

OTHER INFORMATION: n=a, g, c or t 
FEATURE: 

NAME/KEY: misc_feature 
LOCATION: 411 

OTHER INFORMATION: n=a, g, c or t 
FEATURE: 

NAME/ KEY: misc_feature 
LOCATION: 422 

OTHER INFORMATION: m=a or c 
FEATURE: 

NAME /KEY: UNSURE 
LOCATION: 2 
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us 



OTHER INFORMATION: 
FEATURE : 

NAME/ KEY : UNSURE ■ 
LOCATION: 16 
OTHER INFORMATION: 
FEATURE: 

NAME /KEY: UNSURE 
LOCATION: 43 
OTHER INFORMATION: 
FEATURE: 

NAME/KEY: UNSURE 
LOCATION: 4 9 
OTHER INFORMATION: 
FEATURE: 

NAME /KEY: UNSURE 
LOCATION: 58 
OTHER INFORMATION: 
FEATURE: 

NAME/KEY: UNSURE 
LOCATION: 60 
OTHER INFORMATION: 
FEATURE: 

NAME/ KEY : UNSURE 
LOCATION: 63 
OTHER INFORMATION: 
-09-513-999C-3746 



Xaa=Cys or Gly or Arg or Ser 



Xaa=His or Gin 



Xaa=Ala or Gly 



Xaa=Cys or Gly or Arg or Ser 



Xaa=Asn or Thr 



Xaa= * or Glu or Lys or Gin 



Xaa=His or Gin 



Alignment Scores: 

Pred. No. : 2.61e-ll 

Score: 253.50 

Percent Similarity: 62.0% 

Best Local Similarity: 52.0% 



Length: 489 

Matches: 52 

Conservative: 10 

Mismatches: 33 



start | next page 

SCORE 1.3 BuildDate: 11/17/2006 



http://es/ScoreAccessWeb/(M 



1/22/2007 



SCORE Search Results Details for Application 

10729895 and Search Result 
20070112_180215_us-10^729-895a-2.rnpbm. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 
20070112_180215_us-10-729-895a-2.rnpbm. 

start I next page 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 
Run on: January 16, 2007, 05:31:04 



; Search time 1483 Seconds 
(without alignments) 
4250.544 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



US-10-729-895A-2 
1909 

1 MPFLLGLRQDKEACVGTNNQ. 



.LLNTINEQDSPNSQSSSSPS 342 



BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

18892170 seqs, 6143817638 residues 



37784340 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 4 5 summaries 

Command line parameters: 
-M0DEL=frame+_p2n.model -DEV=xlp 

-Q=/abss/ABSSWEB_spool/US10729895/runat_12012007_180215_7466/app_query. fasta_l 
-DB=Published_Applications_NA_Main -QFMT=fastap -SUFFIX=rnpbm -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 
-TRANS=human4 0.cdi -LIST=4 5 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 
-THR_MIN=0 -ALIGN=15 -M0DE= LOCAL -OUTFMT=pto -N0RM=ext ~HEAPSIZE=500 -MINLEN=0 
-MAXLEN=2000000000 -HOST=abss05p 

-USER=US10729895_@CGN_l_l_2530_@runat_12012007_180215_74 66 -NCPU=6 -ICPU=3 
-N0_MMAP -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG -DEV_TIMEOUT=120 
-WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAP0P=6 -FGAPEXT=7 
-YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database : 



Published_Applica 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 



tions_NA_Main:* 
/EMC_Celerra_SIDS3/ptoda ta/2/pubpna/US07_PUBCOMB . seq : * 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US08_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US09A_PUBCOMB. seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US09B_PUBCOMB. seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US09C_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10A_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10B_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10C_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10D_PUBCOMB.seq:+ 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10E_PUBCOMB.seq:^ 
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11 

12 
13 
14 
15 
16 



/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10F 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10G 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllA' 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllC 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllD 



PUBCOMB.seq:* 
~PUBCOMB.seq:* 
~PUBCOMB.seq:* 
~PUBCOMB.seq:* 
~PUBCOMB.seq:+ 
~PUBCOMB.seq:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB 



ID 



Description 





1 


1 ono 

i yuy 


100 


0 


1 non 
lUoU 


1 O 
1 Z 




Sequence 1, Appli 




2 


1 ana 
1909 


100.0 


zy / o 


1 


lie in OQ"7 QQn_ "7 

ub-iu-zy ' 0 0 u / 


Sequence 7, Appli 




3 


1 ono 

l y uy 


100 


0 


A 1 O O 
H 1 ZZ 


1 O 

Iz 


IIC in T)0_QQ^_1 c 

u b - 1 u - / ^y-o5j-io 


Sequence 16, Appl 




4 


l y ui 


99 


6 


Z / U 3 


-7 
1 


TIC in C\Q A ~~7 A Q_QO'5 

u b- iu -u y 4 - /4 y — oz j 


sequence ozo, App 




c 
D 


1 q a q 
lb DO 


97 


9 


1 1 a n 

1 1 H U 


1 0 
Iz 


u b - 1 u - /zy-ayo-j 


Sequence 3 , Appli 




6 


18 63 


97 


6 


1 OUa 


1 


ub-iu-zy±-i /z-jz 


Sequence 32, Appl 




7 


18 63 


97 


6 


1508 


8 


TIC in 0 0 1 OTQ *3 O 

Ub-iU-z^l-z / o-Jz 


Sequence 32, Appl 




8 


1863 


97 


6 


1 550 


■7 


iic in 1 1 o / 00 ic 
Ub-lU-liy-4Zo~.3D 


Sequence 35, Appl 




9 


18 63 


97 


6 


3095 




tic na Q1/1 ici on 1 1 *3 


O AS 1 *! 1 A ft S-i A Oni 1 0 ft 

bequence zuiio, j\ 




10 


1805 


94 


6 


1715 


7 


ire in oqi 1 t 0 /no 
US-lU-zyi-1 /z-4Uo 


Sequence 408, App 




1 1 


1805 


94 


6 


1715 


8 


tic in 001 OTQ yiriQ 
Ub-lU-ZZl-Z /o-4Uo 


Sequence 408, App 




12 


1294 


67 


8 


69081 


6 


US- 10-08 7 -192 -11 92 


Sequence 1192, Ap 


c 


13 


918 


48 


1 


505 


3 


iic na Q £Z A TCI Q A A 1 


Sequence 8443, Ap 


c 


14 


753 


39 


4 


684 973 


3 


tic no ICO QCQ 1 

US-09-263-959-1 


Sequence 1, Appli 




15 


751 


39 


3 


4 39 


3 


tic no Q1/1 O K O 1/IKCT 

Ub-Uy-ol4-JDJ-14obo 


Sequence 14563, A 




16 


709.5 


37 


2 


566 


4 


tic no qoc ncKi\ q>ioiqq 

ub-uy-yz o-udoa-o4 z 1 a y 


bequence oQzioy, 




17 


709 . 5 


37 


2 


566 


5 


US-09-925-U65A-84 2 189 


Sequence 842189, 


c 


18 


640 


33 


5 


352 


3 


TIC r»Q OCA TCI ocioi 

US-09-864 -761-251 11 


bequence zoi/i, A 




19 


631 


33 


1 


382 


3 


tic no 01/1 ici iQ"ao 


Sequence 1832, Ap 




20 


631 


33 


1 


382 


3 


tic no oiyi tcj qitq 

ub-uy-oi4 - jd j-oi /y 


Sequence 8179, Ap 




2 1 


579 


30 


3 


A CO 

4 oz 


j 


TIC HQ Q1Q QQR QQ T C 

ub-uy-yio-yy D-yo /o 


Sequence 9876, Ap 




22 


554 . 5 


9 Q 

z y 


u 


600 


i n 


iic in aoo mo TQR/ta 

ub— iu-y /z-u/y— /y 04 y 


bequence /yoiy, a 




23 


525.5 


27 


5 




1 n 
IU 


iic in aoo m Q tqcji 0 

ub— iu-y /z-u/y- /yD4o 


Sequence 79548, A 




Z H 


47 6.5 


25 


0 


1 1 99 


3 


r|c_nq_q?t;.^f)i-i in 

\J «J U3 Z? C. ~> *JWX ± ±\J 






25 


474 .5 


24 


9 


2047 


7 


US-10-108-260A-107 


Sequence 107, App 




26 


474 


24 


8 


1158 


3 


US-09-822-849A-15 


Sequence 15, Appl 




27 


469.5 


24 


6 


' 2042 


7 


US-10-094-749-344 


Sequence 34 4, App 




28 


462 


24 


2 


399 


10 


US-10-926-683-82 


Sequence 82, Appl 




29 


452 


23 


7 


1292 


3 


US-09-903-190-182 


Sequence 182, App 




30 


452 


23 


7 


1292 


10 


US-10-930-331-182 


Sequence 182, App 


c 


31 


4 50 


23 


6 


528 


12 


US-10-301-480-503460 


Sequence 503460, 


c 


32 


4 50 


23 


6 


528 


12 


US-10-301-480-1 116869 


Sequence 1116869, 




33 


4 50 


23 


6 


561 


12 


US-10-301-480-4 614 07 


Sequence 461407, 




34 


4 50 


23 


6 


561 


12 


US-10-301-480-1074 816 


Sequence 1074816, 


c 


35 


4 50 


23 


6 


562 


4 


US-09-925-065A-125966 


Sequence 125966, 


c 


36 


4 50 


23 


6 


562 


5 


US-09-925-065A-125966 


Sequence 125966, 


c 


37 


450 


23 


6 


568. 


4 


US-09-925-065A-442874 ' 


Sequence 442874, 


c 


38 


450 


23 


6 


568 


5 


US-09-925-065A-44 2874 


Sequence 442874, 


c 


39 


450 


23 


6 


571 


12 


US-10-301-480-223349 


Sequence 223349, 


c 


40 


450 


23 


6 


571 


12 


US-10-301-480-836758 


Sequence 836758, 


c 


41 


449 


23 


5 


558 


12 


US-10-301-480-503459 


Sequence 5034 59, 


c 


42 


449 


23 


5 


558 


12 


US-10-301-480-1116868 


Sequence 1116868, 


c 


43 


449 


23 


5 


569 


4 


US-09-925-065A-442873 


Sequence 442873, 


c 


44 


449 


23 


5 


569 


5 


US-09-925-065A-442873 


Sequence 442873, 




45 


448 


23 


5 


561 


12 


US-10-301-480-461406 


Sequence 461406, 



ALIGNMENTS 

RESULT 1 
US-10-729-895-1 

; Sequence 1, Application US/10729895 
; Publication No. US20060063156A1 
; GENERAL INFORMATION: 

; APPLICANT: UNIVERSITY OF NEW MEXICO 

; TITLE OF INVENTION: OUTCOME PREDICTION AND RISK CLASSIFICATION IN CHILDHOOD 

; TITLE OF INVENTION: LEUKEMIA 

; FILE REFERENCE: N12-038US/ 310 . 00050101 

; CURRENT APPLICATION NUMBER: US/ 10/729, 8 95 

; CURRENT FILING DATE: 2003-12-05 
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; PRIOR APPLICATION NUMBER: 60/510,904 

; PRIOR FILING DATE: 2003-10-14 

; PRIOR APPLICATION NUMBER: 60/510,968 

; PRIOR FILING DATE: 2003-10-14 

; PRIOR APPLICATION NUMBER: 60/432,064 

; PRIOR FILING DATE: 2002-12-06 

; PRIOR APPLICATION NUMBER: 60/432,077 

; PRIOR FILING DATE: 2002-12-06 

; PRIOR APPLICATION NUMBER: 60/432,078 

; PRIOR FILING DATE: 2002-12-06 

; NUMBER OF SEQ ID NOS : 18 

; SOFTWARE: Patentln Ver. 3.2 

; SEQ ID NO 1 

LENGTH: 1080 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME /KEY : CDS 
LOCATION: (!)..( 1026) 
US-10-729-895-1 



Alignment Scores: 



Pred. No.: 


l.lle-177 


Length: 


1080 


Score: 


1909.00 


Matches : 


342 


Percent Similarity: 


100.0% 


Conservative : 


0 


Best Local Similarity: 


100.0% 


Mismatches : 


0 


Query Match: 


100.0% 


Indels: 


0 


DB: 


12 


Gaps: 


0 



US-10-729-895A-2 (1-342) x US-10-729-895-1 (1-1080) 



Qy 


i 


Me tProPheLeuLeuGlyLeuArgGl nAspLysGluAlaCysValGlyThrAsnAsnGln 


z u 






1 t 1 1 t 1 1 1 i \ 1 \ \ 1 1 1 1 1 1 1 1 l 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 1 t » l 1 1 1 1 1 1 1 1 1 i i 1 1 1 

1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


i 


AT GCCTTTCCTTTT GGGT CTT AGACAGGAT AAGGAAGCCT GT GT GGGT AC CAAC AAT C AA 


60 


Qy 




SerTyr IleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 


a n 






i i i i i i i > i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i 
1 1 1 ! 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 II 1 II II 1 




Db 


61 


AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 


120 


wy 


4 1 


m nT.piiiT rnT rr» Php>T rr»T.p»i iVa 1 T rnTh r T 1 e>T 1 e» T 1 pT 1 pLpii^p rPvQpvQCvrQVa 1 
ui uiiCU i i_ £J J. ipnici i^JLicu vai i ipi 1 l i. ij.cij.ciicj.ic juc u jci o^<y ov<^ s v a x 


60 






1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 




Db 


121 


GAAC T CT GGT GGT T CT GGCT GGTGT GGAC C AT CAT CAT C AT CCT GAGCT GCT GCT GT GT T 


180 


Qy 


61 


CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 


80 






1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 


240 


Qy 


81 


LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 


100 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


241 


CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 


300 


Qy 


101 


ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 II 1 




Db 


301 


CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


360 


Qy 


121 


ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 


140 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 


420 


Qy 


141 


GlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerPro 


160 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 


480 


Qy 


161 


LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ll 1 1 1 1 1 ll 1 1 i 1 




Db 


481 


TT GT CT GAGCCC AGC AGAAGCAGC AC AAGACC CC C AAGC AT C GCT GAC CCT GAT CC CT CT 


540 


Qy 


181 


AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 


200 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 II i 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


541 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


600 


Qy 


201 


AlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGlu 


220 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 


660 


Qy 


221 


GluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrPro 


240 
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Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



I I I I I I I I 1 1 1 1 1 1 I I I I I I I I I I I I I ! I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I 1 1 
661 GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 720 

24 1 GlyArgHisArgArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGly 260 

I I I I I I I I 1 1 1 1 M I I I I I I I I II I I 1 1 I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
721 GGGAGAC ATC GCCGCT T C AC AGGT GACT CGGGC ATT GAAGTGT GT GTGT GC AACCGGGGC 780 

261 HisHisAspAspAspLeuLysGluPheAsnThrLeuIleAspAspAlaLe uAspGlyPro 280 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
781 CACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGATGATGCTCTGGATGGGCCC 840 

281 LeuAspPheCysAspSerCysHisValArgProProGlyAspGluGluGluGlyLeuCys 300 

I I I I I I I I I I I I I I I I I I I I I I I I I ! I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
841 CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 900 



301 GlnSerSerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

90 1 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 

321 CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

961 T GCCTGCT GCT GAAC A C C AT C AAC GAGC AGGACT CT CC C AACT C C C AG AGC AGC AGCT C C 



320 



960 



340 



1020 



341 



1021 



ProSer 
I 1 I I I I 
CCCAGC 



342 



1026 



RESULT 2 
US-10-297-880-7 

Sequence 7, Application US/10297880 
Publication No. US20030211513A1 
GENERAL INFORMATION: 
APPLICANT: INCYTE GENOMICS, INC.; YUE, Henry 
APPLICANT: HE, Ann; NGUYEN, Danniel B. 
APPLICANT: YAO, Monique G. ; BANDMAN, Olga 
APPLICANT: BURFORD, Neil; TANG, Y. Tom 
APPLICANT: XU, Yuming; HAFALIA, April J. A. 
APPLICANT: AZIMZAI, Yalda; CHAWLA, Narinder K. 
TITLE OF INVENTION: INTRACELLULAR SIGNALING PROTEINS 
FILE REFERENCE: PF-0782 USN 
CURRENT APPLICATION NUMBER: US/10/297,880 
CURRENT FILING DATE: 2002-12-09 
PRIOR APPLICATION NUMBER: PCT/US01/ 18595 
PRIOR FILING DATE: 2001-07-07 
PRIOR APPLICATION NUMBER: US 60/210,582 
PRIOR FILING DATE: 2000-06-08 
PRIOR APPLICATION NUMBER: US 60/212,44 3 
PRIOR FILING DATE: 2000-06-16 
NUMBER OF SEQ ID NOS: 10 
SOFTWARE: PERL Program 
SEQ ID NO 7 
LENGTH: 2976 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 

NAME /KEY: misc_feature 

OTHER INFORMATION: Incyte ID No: 1478005CB1 
US-10-297-880-7 

Alignment Scores: 



Pred. No. : 


3.92e-177 


Length: 


2976 


Score: 


1909.00 


Matches : 


342 


Percent Similarity: 


100.0% 


Conservative: 


0 


Best Local Similarity: 


100.0% 


Mismatches : 


0 


Query Match: 


100.0% 


Indels: 


0 


DB: 


7 


Gaps: 


0 



US-10-729-895A-2 (1-342) x US-10-297-880-7 (1-2976) 



Qy 1 MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 20 

I I I I I I I I I I I I I I I I I I I I II I I I 1 I i I I I I I I I 1 I I I I I I I I I I I I I I I I I I II I I I I 

Db 479 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 538 

Qy 21 SerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 539 AGCT ACATCTGTGACACAGGACACT GCT GTGGACAGTCTCAGTGCTGC AACT ACTACT AT 598 
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Qy 


41 


GluLeuTrpTrpPheTrpLeuValTrpThrllellellelleLeuSerCysCysCysVal 


60 






1 1 1 II 1 1 M 1 I 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 




Db 


599 


GAACTCTGGT GGTT CT GGCT GGTGTGGACC AT CATC AT CATCCT GAGCTGCTGCTGT GTT 


658 


Qy 


61 


CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 


80 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


659 


TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 


718 


Qy 


81 


LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 


100 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


719 


CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 


778 


Qy 


101 


ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


779 


CC AAAC T ATT T ACT AC CT CCTT AT GAGG AAGT GGT G AAC C GAC C T C CAACT C CT C C C C C A 


8 38 


Qy 


121 


ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 


140 






1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 




Db 


839 


CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 


898 


Qy 


141 


GlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerPro 


160 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1.1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 




Db 


899 


GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 


958 


Qy 


161 


LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 


180 






1 1 I) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 II t 1 1 1 1 1 I 1 1 1 1 1 1 I t 1 1 I II 1 i 1 1 1 1 1 1 1 1 1 1 




Db 


959 


TT GT CT GAGC C C AGCAGAAGC AGC AC AAGACC CC CAAGC AT C GCT GAC CC T G AT CC CT CT 


1018 


Qy 


181 


AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 


200 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1019 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


1078 


Qy 


201 


AlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGlu 


220 ' 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


1079 


GCTGGCCT GGGGGAGCTGGACCCGGGGGCCTT CCTGGACAAAGATGCAGAAT GT AGGGAG 


1138 


Qy 


221 


GluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrPro 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1139 


GAGCTGCT GAAAGATGAC AGCT CT GAAC ACGGCGCACCCGACAGCAAAGAGAAGACGC CT 


1 198 


Qy 


241 


GlyArgHisArgArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGly 


260 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1199 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


1258 


Qy 


261 


HisHisAspAspAspLeuLysGluPheAsnThrLeuIleAspAspAlaLeuAspGlyPro 


280 






1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 




Db 


1259 


CACC AT GA CG AT GACCT C AAAG AGTT C AAC AC ACT CAT CGAT GAT GCT CT GGAT GGGC C C 


1318 


Qy 


281 


LeuAspPheCysAspSerCysHisValArgProProGlyAspGluGluGluGlyLeuCys 


300 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


1319 


CT GGACTT CT GCGACAGCTGCC AT GT GCGGCCCCCT GGTGAT GAGGAGGAAGGCCT CT GT 


137 8 


Qy 


301 


GlnSerSerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 


320 






1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1379 


CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 


1438 


Qy 


321 


CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 


340 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 t 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


14 39 




1498 


Qy 


341 


ProSer 342 




Db 


1499 


1 1 1 1 1 1 

CCCAGC 1504 




RESULT 


3 






US-10- 


729-895- 


-16 





Sequence 16, Application US/10729895 
Publication No. US20060063156A1 
GENERAL INFORMATION: 
APPLICANT: UNIVERSITY OF NEW MEXICO 

TITLE OF INVENTION: OUTCOME PREDICTION AND RISK CLASSIFICATION IN CHILDHOOD 

TITLE OF INVENTION: LEUKEMIA 

FILE REFERENCE: N12-038US/ 310 . 00050101 

CURRENT APPLICATION NUMBER: US/ 10/729, 895 

CURRENT FILING DATE: 2003-12-05 



http://es/ScoreAccessWeb^ 



1/22/2007 



PRIOR APPLICATION NUMBER: 60/510,904 
PRIOR FILING DATE: 2003-10-14 
PRIOR APPLICATION NUMBER: 60/510,968 
PRIOR FILING DATE: 2003-10-14 
PRIOR APPLICATION NUMBER: 60/432,064 
PRIOR FILING DATE: 2002-12-06 
PRIOR APPLICATION NUMBER: 60/432,077 
PRIOR FILING DATE: 2002-12-06 
PRIOR APPLICATION NUMBER: 60/432,078 
PRIOR FILING DATE: 2002-12-06 
NUMBER OF SEQ ID NOS : 18 
SOFTWARE: Patentln Ver. 3.2 
SEQ ID NO 16 
LENGTH: 4122 
TYPE: DNA 
. ORGANISM*: Homo sapiens 
US-10-729-895-16 



Alignment Scores: 



Pred. No.: 


5.87e-177 


Length: ' 


4122 


Score : 


1909.00 


Matches : 


342 


Percent Similarity: 


100.0% 


Conservative : 


0 


Best Local Similarity: 


100.0% 


Mismatches: 


0 


Query Match: 


100.0% 


Indels : 


0 


DB: 


12 


Gaps : 


0 



US-10-729-895A-2 (1-342) x US-10-729-895-16 (1-4122) 

MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 20 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I 
ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 204 

SerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 4 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I II I I I I I I I I I I I I 
AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 264 

GluLeuTrpTrpPheTrpLeuValTrpThrllellellelleLeuSerCysCysCysVal 60 

I I I I I I I I I I I I I I 1 1 I M I II I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 324 

CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 80 
I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I 1 I I 
TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 384 

LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 100 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I 1 I I 
CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 4 44 

ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 504 

ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 140 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 564 

GlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerPro 160 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 624 

LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TT GT CT GAGC CC AGCAGAAGC AGC AC AAGACC CC CAAGC AT C GCT G AC CCTG AT CC CT CT 684 

AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 200 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 744 



Qy 


1 


Db 


145 


Qy 


21 


Db 


205 


Qy 


41 


Db 


265 


Qy. 


61 


Db 


325 


Qy 


81 


Db 


385 


Qy 


. 101 


Db 


445 


Qy 


121 


Db 


505 


Qy 


141 


Db 


565 


Qy 


161 


Db 


625 


Qy 


181 


Db 


685 


Qy 


201 


Db 


745 


Qy 


221 


Db 


805 



220 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 



I I I I I I I I I I I I I I I i I I I I I I I I i I I I I I.I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 



864 



http://es/ScoreAccessWeb/Gefltem.action?AppId=10729895&seqId=1048746&Item 1/22/2007 



Ov 


24 1 


GlyArgHisArgArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGly 


260 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


865 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


924 


Ov 

vy 


261 


HisHisAspAspAspLeuLysGluPheAsnThrLeuIleAspAspAlaLeuAspGlyPro 


280 






1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 i 1 1 1 1 I 1 I 1 1 I 1 1 1 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 1 




Db 


925 


CACCAT GACGAT GACCTCAAAGAGTTCAACACACTCAT CGATGATGCTCTGGAT GGGCCC 


984 




281 


LeuAspPheCysAspSerCysHisVa lArgProProGlyAspGluGluGluGlyLeuCys 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


985 


CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 


1044 


Ov 


301 


GlnSerSe rGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 


320 






1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 




Db 


1045 


CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 


1104 


Qy 


321 


CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 


340 






1 I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 




Db 


1105 


TGCCTGCT GCTGAACACCATCAACGAGCAGGACT CT CCCAACTCCCAGAGCAGCAGCT CC 


1164 


Ov 


34 1 


ProSer 342 




Db 


1165 


1 1 1 ! 1 1 

CCCAGC 1170 




RESULT 


4 






US-10- 


094-749- 


-823 





Sequence 823, Application US/10094749 
Publication No. US2003021974 1A1 
GENERAL INFORMATION: 

APPLICANT: ISOGAI, TAKAO 

APPLICANT: SUGIYAMA, TOMOYASU 

APPLICANT: OTSUKI, TETSUJI 

APPLICANT: WAKAMATSU, AI 

APPLICANT: SATO, HIROYUKI 

APPLICANT: ISHII, SHIZUKO 

APPLICANT: YAMAMOTO, JUN-ICHI 

APPLICANT: ISONO, YUUKO 

APPLICANT: HIO, YURI 

APPLICANT: OTSUKA, KAORU 

APPLICANT: NAGAI , KEIICHI 

APPLICANT: IRIE, RYOTARO 

APPLICANT: TAMECHIKA, ICHIRO 

APPLICANT: SEKI, NAOHIKO 

APPLICANT: YOSHIKAWA, TSUTOMU 

APPLICANT: OTSUKA, MOTOYUKI 

APPLICANT: NAGAHARI, KEN J I 

APPLICANT: MASUHO, YASUHIKO 

TITLE OF INVENTION: NOVEL FULL-LENGTH cDNA 

FILE REFERENCE: 084335/0160 

CURRENT APPLICATION NUMBER: US/ 10/094 , 749 

CURRENT FILING DATE: 2002-03-12 

PRIOR APPLICATION NUMBER: 60/350,435 

PRIOR FILING DATE: 2002-01-24 

PRIOR APPLICATION NUMBER: JP 2001-328381 

PRIOR FILING DATE: 2001-09-14 

NUMBER OF SEQ ID NOS: 3381 

SOFTWARE: Paten tin Ver. 2.1 
SEQ ID NO 823 
LENGTH: 2703 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-094-749-823 



Alignment Scores: 



Pred. No.: 


2.13e-176 


Length: 


2703 


Score: 


1901 .00 


Matches: 


340 


Percent Similarity: 


99.7% 


Conservative : 


1 


Best Local Similarity: 


99.4% 


Mismatches: 


1 


Query Match: 


99.6% 


Indels: 


0 


DB: 


7 


Gaps: 


0 



US-10-729-895A-2 (1-342) x US-10-094-749-823 (1-2703) 



Qy 1 MetProPheLeuLeuGlyLeuArgGlnAspLysGluAlaCysValGlyThrAsnAsnGln 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I i I I I I I I I I I I I I I 1 I I 
Db 14 5 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 204 



http://es/ScoreAccessWeb/Gefltem.action?AppId=10729895&seqId=1048746&Item 1/22/2007 



Qy 

Db 



21 SerTyrlleCysAspThrGlyHisCysCysGlyGlnSerGlnCysCysAsnTyrTyrTyr 4 0 
I I 1 1 I I I I I I I I I I I I I ! I I I I I I I I I I I I 1 1 I I I I I I I I I I I I II I I I I I I I I II I I I I 
205 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 264 



Qy 


41 


GluLeuTrpTrpPheTrpLeuValTrpThrllellellelleLeuSerCysCysCysVal 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 I 1 1 If ! 1 1 I ! 




Db 


265 


GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 


324 


Qy 


61 


CysHisHisArgArgAlaLysHisArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsn 


80 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


325 


TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 


384 


Qy 


81 


LeuIleAlaTyrArgGluAlaHisAsnTyrSerAlaLeuProPheTyrPheArgPheLeu 


100 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


385 


CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 


444 


Qy 


101 


ProAsnTyrLeuLeuProProTyrGluGluValValAsnArgProProThrProProPro 


120 






1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


445 


CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


504 


Qy 


121 


ProTyrSerAlaPheGlnLeuGlnGlnGlnGlnLeuLeuProProGlnCysGlyProAla 


140 






1 1 1 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


505 


CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 


564 


Qy 


141 


GlyGlySerProProGlylleAspProThrArgGlySerGlnGlyAlaGlnSerSerPro 


160 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! i 1 1 1 1 




Db 


565 


GGT GGC AGT C CC CC GGGC AT CG AT CC C ACC AGGGGAT C CC AGGGGGC AC AGAGC AGCC CC 


624 


Qy 


1.61 


LeuSerGluProSerArgSerSerThrArgProProSerlleAlaAspProAspProSer 


180 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


625 


TT GT CT GAGC CC AGC AGAAGCAGC ACAAGACC CCCAAGCATCGCTGACCCTGAT CCCT CT 


684 


Qy 


181 


AspLeuProValAspArgAlaAlaThrLysAlaProGlyMetGluProSerGlySerVal 


200 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


685 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


744 


Qy 


• 201 


AlaGlyLeuGlyGluLeuAspProGlyAlaPheLeuAspLysAspAlaGluCysArgGlu 


220 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


74 5 


GCT GGC CT GGGGGAGCT GGACC C GGGGGCC TT CC T GGAC AAAGAT GCAGAAT GT AGGG AG 


804 


Qy 


221 


GluLeuLeuLysAspAspSerSerGluHisGlyAlaProAspSerLysGluLysThrPro 


240 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 




Db 


805 


GAGC T GCT GAAAGAT GAC AGCT CT GAAC AC GGC GC AC C C G AC AGC AAAGAGAAGAC GC CT 


864 


Qy 


241 


GlyArgHisArgArgPheThrGlyAspSerGlylleGluValCysValCysAsnArgGly 


260 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 i 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


865 


GGGAGACATC GCCGCT T C AC AGGT GACT CGGGC ATT GAAGT GT GT GT GT GC AAC CGGGGC 


924 


Qy 


261 


Hi sHisAspAspAspLeuLysGlu PheAsnTh rLeu I leAspAspAla LeuAspGlyPro 


280 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


925 


CACC AT GACGAT GACCTC AAAGAGTT CAAC AC ACTC AT CGATGATGCT CT GGAT GGGCCC 


984 


Qy 


281 


LeuAspPheCysAspSerCysHisVa lArgProProGlyAspGluGluGluGlyLeuCys 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


985 


CT GG ACTT CT GC GAC AGCT GCC AT GT GC GGCC CC CT GGT GAT GAGG AGGAAGGC CT CT GT 


1044 


Qy 


301 


GlnSerSerGluGluGlnAlaArgGluProGlyHisProHisLeuProArgProProAla 


320 






III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I 1 1 1 1 1 II 1 1 i : :: 




Db 


1045 


C AGC CC TC T G AGGAGC AGGCT C GAGAGCCT GGGC AC CC GCACCTGC C ACGGC CGCC CT C A 


1104 


Qy 


321 


CysLeuLeuLeuAsnThrlleAsnGluGlnAspSerProAsnSerGlnSerSerSerSer 


340 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 




Db 


1105 


TGCC TGCT GCT GAAC ACC AT CAAC GA GC AGGACT CT CC C AACT C C C AG AGC AGC AGCT C C 


1164 


Qy 


341 


ProSer 34 2 




Db 


1165 


linn 

CCCAGC 1170 




RESULT 


5 






US-10- 


729-895- 


-3 • 





; Sequence 3, Application US/10729895 

; Publication No. US20060063156A1 

; GENERAL INFORMATION: 

; APPLICANT: UNIVERSITY OF NEW MEXICO 

; TITLE OF INVENTION: OUTCOME PREDICTION AND RISK CLASSIFICATION IN CHILDHOOD 



hllp://es/ScoreAccessWeb/Gefltem.action?AppId=10729895&seqId=1048746& 1/22/2007 



; TITLE OF INVENTION: LEUKEMIA 

; FILE REFERENCE: N12-038US/310 . 00050101 

; CURRENT APPLICATION NUMBER: US/ 10/729 , 8 95 

; CURRENT FILING DATE: 2003-12-05 

; PRIOR APPLICATION NUMBER: 60/510,904 

; PRIOR FILING DATE: 2003-10-14 

; PRIOR APPLICATION NUMBER: 60/510,968 

; PRIOR FILING DATE: 2003-10-14 

; PRIOR APPLICATION NUMBER: 60/432,064 

; PRIOR FILING DATE: 2002-12-06 

; PRIOR APPLICATION NUMBER: 60/432,077 

; PRIOR FILING DATE: 2002-12-06 

; PRIOR APPLICATION NUMBER: 60/432,078 

; PRIOR FILING DATE: 2002-12-06 

; NUMBER OF SEQ ID NOS: 18 

; SOFTWARE: Pa ten tin Ver. 3.2 

; SEQ ID NO 3 

LENGTH: 1140 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: " 
NAME/ KEY: CDS 
LOCATION: (1)..{1089) 
US-10-729-895-3 



Alignment Scores: 



Pred. No. : 


1.3e-173 


Length : 


1140 


Score: 


1868.00 


Matches : 


334 


Percent Similarity: 


100.0% 


Conservative: 


0 


Best Local Similarity: 


100.0% 


Mismatches: 


0 


Query Match: 


97.9% 


Indels: 


0 


DB: 


12 


Gaps : 


0 



US-10-729-895A-2 (1-342) x US-10-729-895-3 (1-1140) 



Qy 9 GlnAspLysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHis 28 

I I I I I I I I I I I I I I I I I i II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 88 C AGGAT AAGG AAGC CT GT GT GGGT AC CAAC AAT C AAAGCT AC AT CT GT GAC AC AGGAC AC 14 7 

Qy 29 CysCysGlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuVal 4 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I 1 I I I I I I I I I I I I I I I I 

Db 14 8 T GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT GAACT CT GGT GGT T CT GGCT GGT G 207 

Qy 4 9 TrpThrllellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHis 68 

II I I I I I I I I I I I I I I I I I I I I I I I I i I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I 

Db 208 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 267 

Qy 69 ArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArgGluAlaHis 88 

I I I I I I I I I 1 I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 268 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 327 

Qy 8 9 AsnTyrSerAlaLeuProPheTyrPheArgPheLeuProAsnTyrLeuLeuProProTyr 108 

I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 

Db 328 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 387 

Qy 109 GluGluValValAsnArgProProThrProProProProTyrSerAlaPheGlnLeuGln 128 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 388 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 447 

Qy 12 9 GlnGlnGlnLeuLeuProProGlnCysGlyProAlaGlyGlySerProProGlylleAsp 148 

I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 44 8 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 507 

Qy 14 9 ProThrArgGlySerGlnGlyAlaGlnSerSerProLeuSerGluProSerArgSerSer 168 

I I I I I I I I M II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 508 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 567 

Qy 169 ThrArgProProSerlleAlaAspProAspProSerAspLeuProValAspArgAlaAla 188 

I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 568 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 627 

Qy 189 ThrLysAlaProGlyMetGluProSerGlySerValAlaGlyLeuGlyGluLeuAspPro 208 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 628 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 687 

Qy 209 GlyAlaPheLeuAspLysAspAlaGluCysArgGluGluLeuLeuLysAspAspSerSer 228 
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I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 



Db 


688 


GGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTCT 


747 


Qy 


229 


GluHisGlyAlaProAspSerLysGluLysThrProGlyArgHisArgArgPheThrGly 


248 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


748 


GAAC ACGGCGC ACC CG AC AGC AAAGAGAAG AC GC CT GGGAGAC AT C GC C GCT T C AC AGGT 


807 


Qy 


249 


AspSerGlylleGluValCysValCysAsnArgGlyHisHisAspAspAspLeuLysGlu 


268 






1 1 1 1 1 1 1 1 ! 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


808 


GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 


867 


Qy 


269 


PheAsnThrLeuIleAspAspAlaLeuAspGlyProLeuAspPheCysAspSerCysHis 


288 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


868 


TT CAACAC ACTC AT CGATGATGCT CTGGATGGGCCCCT GGACTT CT GCGACAGCTGCC AT 


927 


Qy 


289 


Va lArgProProGlyAspGluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArg 


308 






1 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


928 


GT GCGGCCCCCT GGTGATGAGGAGGAAGGCCT CT GT CAGT CCTCTGAGGAGC AGGCTCGA 


987 


Ov 


309 


GluProGlyHisProHisLeuProArgProProAlaCysLeuLeuLeuAsnThr I leAsn 


328 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


988 


GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 


1047 


Qy 


329 


GluGlnAspSerProAsnSerGlnSerSerSerSerProSer 342 








1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 




Db 


1048 


GAGC AGGACT CT CC C AACT C C C AGAGC AGC AGCT CC C C C AGC 1089 





RESULT 6 

US-10-291-172-32 

; Sequence 32, Application US/10291172 
; Publication No. US20030228584A1 
; GENERAL INFORMATION: 
; APPLICANT: Hyseq, Inc 

; TITLE OF INVENTION: No. US20030228584 Alel Nucleic Acids and Polypeptides 
; FILE REFERENCE: 21272-045 

; CURRENT APPLICATION NUMBER: US/10/2 91 , 172 

; CURRENT FILING DATE: 2000-11-08 

; PRIOR APPLICATION NUMBER: 09/693,267 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 09/665,363 

; PRIOR FILING DATE: 2000-09-19 

; PRIOR APPLICATION NUMBER: 09/616,847 

; PRIOR FILING DATE: 2000-07-14 

; PRIOR APPLICATION NUMBER: 09/596,193 

; PRIOR FILING DATE: 2000-06-17 

; PRIOR APPLICATION NUMBER: 09/574,454 

; PRIOR FILING DATE: 2000-05-19 

; PRIOR APPLICATION NUMBER: 09/519,705 

; PRIOR FILING. DATE: 2000-03-07 

; NUMBER OF SEQ ID NOS: 7 52 

; SEQ ID NO 32 

LENGTH: 1508 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME /KEY : CDS 
LOCATION: (87).. (1175) 
US-10-291-172-32 



Alignment Scores: 



Pred. No.: 


5.72e-173 


Length : 


1508 


Score : 


1863.00 


Matches : 


333 


Percent Similarity: 


99.7% 


Conservative : 


0 


Best Local Similarity: 


99.7% 


Mismatches : 


1 


Query Match: 


97.6% 


Indels : 


0 


DB: 


7 


Gaps : 


0 



US-10-729-895A-2 (1-342) x US-10-291-172-32 (1-1508) 



Qy 


9 


GlnAspLysGluAlaCysValGlyThrAsnAsnGlnSerTyrlleCysAspThrGlyHis 


28 






1 1 1 1 1 1 1 1 II ! 1 1 1 1 1 1 1 1 1 1 II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


174 


CAGGATAAGGAAGCCTGTGTGGGT ACCAACAATCAAAGCT ACAT CT GTGACACAGGAC AC 


233 


Qy 


29 


CysCysGlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuVal 


48 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


234 


TGCTGTGGACAGTCTCAGTGCTGCAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTG 


293 
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Qy 


49 


TrpThrllellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHis 


68 






1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 I 1 1 1 1 I 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


294 


TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 


353 


Qy 


69 


ArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArgGluAlaHis 


88 






1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


354 


CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 


413 


Qy 


89 


AsnTyrSerAlaLeuProPheTyrPheArgPheLeuProAsnTyrLeuLeuProProTyr 


108 






1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 




Db 


414 


AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 


473 


Qy 


109 


GluGluValValAsnArgProProThrProProProProTyrSerAlaPheGlnLeuGln 


128 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


474 


GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 


533 


Qy 


129 


GlnGlnGlnLeuLeuProProGlnCysGlyProAlaGlyGlySerProProGlylleAsp 


148 






1 ! 1 1 t 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1.1 1 1 1 1 1 1 1 




Db 


534 


CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 


593 


Qy 


149 


ProThrArgGlySerGlnGlyAlaGlnSerSerProLeuSerGluProSerArgSerSer 


168 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


594 


CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 


653 


Qy 


169 


ThrArgProProSerlleAlaAspProAspProSerAspLeuProValAspArgAlaAla 


188 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I 




Db 


654 


ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 


713 


Qy 


189 


ThrLysAlaProGlyMetGluProSerGlySerValAlaGlyLeuGlyGluLeuAspPro 


208 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 




Db 


714 


ACCAAAGC.CCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 


773 


Qy 


209 


GlyAla PheLeuAspLysAspAlaGluCysArgGluGluLeuLeuLysAspAspSe rSer 


228 






1 1 1 ri 1 1 1 1 1 1 1 1 ii 1 1 1 1 ii i i ii i ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


774 


GGGGCCTT CCTGGACAAAGATGCAGAAT GT AGGGAGGAGCTGCT GAAAGATGAC AGCT CT 


833 


Qy 


229 


GluHisGlyAlaProAspSerLysGluLysThrProGlyArgHisArgArgPheThrGly 


248 






1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


834 


GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 


893 


Qy 


249 


AspSerGlylleGluValCysValCysAsnArgGlyHisHisAspAspAspLeuLysGlu 


2 68 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


894 


GACT CGGGCATT GAAGTGTGTGTGTGCAACCGGGGCCACC AT GACGAT GACCTC AAAGAG 


953 


Qy 


269 


PheAsnThrLeuIleAspAspAlaLeuAspGlyProLeuAspPheCysAspSerCysHis 


288 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 H 1 1 1 t 1 1 1 1 1 1 1 1 1 1 




Db 


954 


TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 


1013 


Qy 


289 


Va lArgProProGlyAspGluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArg 


308 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 




Db 


1014 


GT GCGGCCCCCT GGTGAT GAGGAGGAAGGCCT CT GT CAGCCCT CTGAGGAGC AGGCTCGA 


1073 


Qy 


309 


GluProGlyHisProHisLeuProArgProProAlaCysLeuLeuLeuAsnThrlleAsn 


328 






1 1 II 1 1 1 II 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1074 


GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 


1133 


Qy 


329 


GluGlnAspSerProAsnSerGlnSerSerSerSerProSer 342 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 




Db 


1134 


GAGC AGGACT CT CC C AACT CCC AG AGC AGC AGCT CC CC C AGC 117 5 




RESULT 


7 






US-10- 


221-278 


-32 





Sequence 32, Application US/10221278 
Publication No. US2004 0034 208A1 
GENERAL INFORMATION: 
APPLICANT: Hyseq, Inc 

TITLE OF INVENTION: No. US20O4OO342O8Alel Nucleic Acids and Polypeptides 
FILE REFERENCE: 21272-045 

CURRENT APPLICATION NUMBER: US/ 10/221 , 278 

CURRENT FILING DATE: 2002-09-06 

PRIOR APPLICATION NUMBER: 09/693,267 

PRIOR FILING DATE: 2000-10-20 

PRIOR APPLICATION NUMBER: 09/665,363 . 

PRIOR FILING DATE: 2000-09-19 

PRIOR APPLICATION NUMBER: 09/616,847 
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; PRIOR FILING DATE: 2000-07-14 

; PRIOR APPLICATION NUMBER: 09/596,193 

; PRIOR FILING DATE: 2000-06-17 

; PRIOR APPLICATION NUMBER: 09/574,454 

; PRIOR FILING DATE: 2000-05-19 

; PRIOR APPLICATION NUMBER: 09/519,705 

; PRIOR FILING DATE: 2000-03-07 

; NUMBER OF SEQ ID NOS : 7 52 

; SEQ ID NO 32 

LENGTH: 1508 

TYPE: DNA 

ORGANISM: Homo sapiens 
; . FEATURE: 

NAME /KEY : CDS 

LOCATION: (87).. (1175) 
US-10-221-278-32 



Alignment Scores: 



Pred. No.: 


5..72e-173 


Length: 


1508 


Score: 


1863.00 


Matches: 


333 


Percent Similarity: 


99.7% 


Conservative : 


0 


Best Local Similarity: 


99.7% 


Mismatches: 


1 


Query Match: 


97.6% 


Indels: 


0 


DB: 


8 


Gaps: 


0 



US-10-729-895A-2 (1-342) x US-10-221-278-32 (1-1508) 



Qy 


9 


GinAspLysGiuAiacys va ±u±yi nrAsnAsnbinber i yr i le^ysAspi nrLiiynis 


9 Q 






i i i i i i i i i i i i i i i i i i i i i » i i i i i t i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 II 1 1 It 1 ) 1 1 ! I 1 1 1 1 1 1 1 ll 1 II II I 1 M 1 1 1 1 II I 1 1 1 1 1 1 i 




Db 


174 


CAGG AT AAGG AAGC CT GT GT GGGT ACCAAC AAT C AAAGCT AC AT CT GT GAC AC AGGAC AC 


233 


Qy 


z y 


uysuysLiiyo rnoe roin^ysuys/\sni yr i y r 1 yr^r ULieu 1 rpi rptrnei rpiie u vai 


A fl 

H. O 






1 1 1 1 1 1 1 1 1 ! t 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 t 1 1 1 1 1 ] 1 1 t 1 1 1 I 1 1 1 1 1 t 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 I 1 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 .1 1 1 1 I 




Db 


234 


TGCrGTGGACAGTCT.CAGTGCT.GCAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTG 


293 


Qy 


A Q 

4 y 


i rpi nriieiieiiei reijeuber^ys^ys^ysvarL.ysnisnis/\rgr\rgM.xa Lysnis 


D O 






i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i t i i i i \ i i i i i t i i i i i i i i 
1 II II II 1 II II 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


294 


T GGACC ATCAT CAT CATCCT GAGCT GCT GCTGTGTTTGCC ACCACC GCCGAGCC AAGC AC 


353 


yy 


D z) 


gLiGULrJ. n/\j.a vji. riKj ±n/\i. gLj±nn ± ouiun tirvoii ijc u x j.cr\±ct i y l rvx. yui urvj. anis 


88 






1 1 1 1 1 1 1 i t 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 \ 1 1 1 1 1 f 1 1 1 II 1 1 II II 1 ! 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


354 


CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 


413 


Ov 


89 


AsnTyrSerAlaLeuProPheTyrPheArgPheLeuProAsnTyrLeuLeuProProTyr 


108 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


414 


AATT ACT C AGC GCT GC CAT T TT AT T T C AGGTT TT TGCC AAACT ATT T ACT AC CT C C TT AT 


473 


Qy 


109 


GluGluValValAsnArgProProThrProProProProTyrSerAlaPheGlnLeuGln 


128 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


474 


GAGGAAGTGGTGAACCC^CCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 


533 


Qy 


129 


GlnGlnGlnLeuLeuProProGlnCysGlyProAlaGlyGlySerProProGlylleAsp 


148 






! 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


534 


CAGC AGC AGCT GCT GC CT CC AC AGT GT GGC CCT GCAGGT GGC AGT CCCCCGGGCAT CGAT 


593 


Qy 


149 


ProThrArgGlySerGlnGlyAlaGlnSerSerProLeuSerGluProSerArgSerSer 


168 






1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 I 1 1 1 II 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


594 


CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 


653 


Qy 


169 


ThrArgProProSerlleAlaAspProAspProSerAspLeuProValAspArgAlaAla 


188 






1 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


654 


ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 


713 


Qy 


189 


ThrLysAlaProGlyMetGluProSerGlySerValAlaGlyLeuGlyGluLeuAspPro 


208 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


714 


ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 


773 


Qy 


209 


GlyAlaPheLeuAspLysAspAlaGluCysArgGluGluLeuLeuLysAspAspSerSer 


228 






1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 u 1 1 1 1 1 1 1 ii 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 




Db 


774 


GGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTCT 


833 


Qy 


229 


GluHisGlyAlaProAspSerLysGluLysThrProGlyArgHisArgArgPheThrGly 


248 






1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 I 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


834 


GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 


893 


Qy 


24 9 


AspSerGlylleGluValCysValCysAsnArgGlyHisHisAspAspAspLeuLysGlu 


268 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



PheAsnThrLeuIleAspAspAlaLeuAspGlyProLeuAspPheCysAspSerCysHis 288 

j I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I ! Ill II I I I I I I I I 

TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 1013 

ValArgProProGlyAspGluGluGluGlyLeuCysGlnSerSerGluGluGlnAlaArg 308 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGCCCTCTGAGGAGCAGGCTCGA 1 07 3 



I I I I I I i I I I I I I I I I I I I I 1 I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I t 

GAGC CT GGGC AC CCGC AC CT GC C ACGGC CGCC CGC AT GC CT GCT GC T G AAC AC C AT C AAC 1133 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Sequence 35, Application US/10119428 
Publication No. US2003016588 1A1 
GENERAL INFORMATION: 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
Asundi, Vinod 
Xu, Chongjun 
Wehrman, Tom 
Ren, Feiyan 
Ma, Yunqing 
Zhou, Ping 
Zhao, Qing A. 
Yang, Yonghong 
Drmanac, Radoje T. 

No. US20030165881Alel Nucleic Acids and 
Polypeptides 



Db 


894 


Qy 


269 


Db 


954 


Qy 


289 


Db 


1014 


Qy 


309 




1074 


Qy 


329 


Db 


1134 


RESULT 


8 


US-10- 


119-428 



APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
TITLE OF INVENTION: 
TITLE OF INVENTION: 



FILE REFERENCE: 789CIP2 

CURRENT APPLICATION NUMBER: US/ 10/1 19 , 4 28 
CURRENT FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: 09/596,193 
PRIOR FILING DATE: 2000-06-17 
PRIOR APPLICATION NUMBER: 09/574,454 
PRIOR FILING DATE: 2000-05-19 
PRIOR APPLICATION NUMBER: 09/519,705 
PRIOR FILING DATE: 2000-03-07 
NUMBER OF SEQ ID NOS : 55 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 35 
LENGTH: 1550 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME /KEY : CDS 
LOCATION: (129) . . (1220) 
US-10-119-428-35 



Alignment Scores: 



Pred. No.: 


5.92e-173 


Length : 


1550 


Score: 


1863.00 


Matches: 


333 


Percent Similarity: 


99.7% 


Conservative: 


0 


Best Local Similarity: 


99.7% 


Mismatches : 


1 


Query Match: 


97.6% 


Indels : 


0 


DB: 


7 


Gaps : 


0 



US-10-729-895A-2 (1-342) x US-10-119-428-35 (1-1550) 



Qy 


9 


Db 


216 


Qy 


29 


Db 


276 


Qy 


•49 



1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



CysCysGlyGlnSerGlnCysCysAsnTyrTyrTyrGluLeuTrpTrpPheTrpLeuVal 4 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TGCTGTGGACAGTCTCAGTGCTGCAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTG 335 



4 9 TrpThrllellellelleLeuSerCysCysCysValCysHisHisArgArgAlaLysHis 68 



http://es/ScoreAccessWeb/Gedtem.action?AppId=10729895&seqId=1048746& 1/22/2007 



II I I I I I i i t 1 1 I I I t I I I I I I I I I I II I I I I I I 1 1 I I I I I I I I I I I I I I I I 1 1 I I I I I I 



Db 


336 


TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 


395 


Qy 


69 


ArgLeuGlnAlaGlnGlnArgGlnHisGluIleAsnLeuIleAlaTyrArgGluAlaHis 


88 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


396 


CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 


455 


Qy 


89 


AsnTyrSerAlaLeuProPheTyrPheArgPheLeuProAsnTyrLeuLeuProProTyr 


108 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 




Db 


456 


AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 


515 


Qy 


109 


GluGluValValAsnArgProProThrProProProProTyrSerAlaPheGlnLeuGln 


128 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


516 


GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 


575 


Qy 


129 


GlnGlnGlnLeuLeuProPr.oGlnCysGlyProAlaGlyGlySerProProGlylleAsp 


148 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I II 1 I 




Db 


576 


CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 


635 


Qy 


149 


ProThrArgGlySerGlnGlyAlaGlnSerSerProLeuSerGluProSerArgSerSer 


168 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


636 


CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 


695 


Qy 


169 


ThrArgProProSerlleAlaAspProAspProSerAspLeuProValAspArgAlaAla 


188 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I 1 I II 1 1 1 1 1 




Db 


696 


ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 


755 


Qy 


189 


ThrLysAlaProGl^IetGluProSerGlySerValAlaGlyLeuGlyGIuLeuAspPro 


208 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 




Db 


756 


ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 


815 


Qy 


209 


GlyAlaPheLeuAspLysAspAlaGluCysArgGluGluLeuLeuLysAspAspSerSer 


228 



start | next page 

SCORE 1.3 BuildDate: 11/17/2006 
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SCORE Search Results Details for Application 
10729895 and Search Result 
20070109__162358_us-10-729-895a-l.rni. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 

20070109_162358_us-10-729-895a-l.rni. 

start I next page 

Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



January 11, 2007, 01:53:27 ; Search time 245 Seconds 

(without alignments) 
8248.153 Million cell updates/sec 



US-10-729-895A-1 
1080 

1 atgcctttccttttgggtct . 



. aaagcaaccagggtagggga 1080 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



1403666 seqs, 935554401 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



2807332 



Da tabase 



Issued_Patents_NA: * 

1 : /EMC_Celerra_SIDS3/ptodata/2/ina/l_COMB.seq:+ 

2: /EMC_Celerra_SIDS3/ptodata/2/ina/5_COMB.seq:* 

3: /EMC_Celerra_SIDS3/ptodata/2/ina/6A_COMB.seq:* 

4 : /EMC_Celerra_SIDS3/ptodata/2/ina/6B_COMB.seq:* 

5: /EMC_Celerra_SIDS3/ptodata/2/ina/7_COMB.seq:* 

6: /EMC_Celerra_SIDS3/ptodata/2/ina/H_COMB.seq:+ 

7: /EMC_Celerra_SIDS3/ptodata/2/ina/PCTUS_COMB.seq:* 

8: /EMC_Celerra_SIDS3/ptodata/2/ina/PP_COMB.seq:* 

9 : /EMC_Celerra_SIDS3/ptoda ta/2/ina/RE_COMB . seq : * 

10: /EMC_Celerra_SIDS3/ptodata/2/ina/backfilesl .seq:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution.. 



Result 
No. 



SUMMARIES 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 


1076 


.8 


99 


7 


2703 


4 


US-10-094-749-823 


Sequence 


823, 


App 


2 


231 


.6 


21 


4 


399 


3 


US-09-513-999C-274 


Sequence 


274, 


App 


3 


231 


.6 


21 


4 


399 


3 


US-09-471-276-82 


Sequence 


82, 


Appl 


4 


123 


.2 


11 


4 


2042 


4 


US-10-094-749-344 


Sequence 


344, 


App 


5 


122 


.6 


11 


4 


1292 


3 


US-09-247-155-182 


Sequence 


182, 


App 


6 


122 


.6 


11 


4 


1292 


3 


US-09-903-190-182 


Sequence 


182, 


App 
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7 


82 


7. 


6 


489 


3 


US-09-513 


-999C-3746 


Sequence 


3746, Ap 




8 


80 


.4 


7. 


4 


861 


3 


US-09-247 


-155-83 


Sequence 


83, Appl 




9 


80 


.4 


7. 


4 


861 


3 


US-09-903 


-190-83 


Sequence 


83, Appl 




10 


43 


.2 


4 . 


0 


1903 


3 


US-09-148 


-545-55 


Sequence 


55, Appl 




11 


43 


.2 


4 . 


0 


1903 


3 


US-09-621 


-011-55 


Sequence 


55, Appl 




12 


40 


.6 


3. 


8 


519 


3 


US-09-252 


-991A-3601 


Sequence 


3601, Ap 




13 


40 


.6 


3. 


8 


939 


3 


US-09-252 


-991A-3645 


Sequence 


3645, Ap 




14 


40 


.6 


3. 


8 


1881 


3 


US-09-252 


-991A-3697 


Sequence 


3697, Ap 


c 


15 


40 


.6 


3. 


8 


1998 


3- 


US-09-252 


-991A-3812 


Sequence 


3812, Ap 


c 


16 


39 


.4 


3. 


6 


719 


3 


US-09-620 


-312D-669 


Sequence 


669, App 


c 


17 


38 


.8 


3. 


6 


978 


3 


US-09-252 


-991A-86 


Sequence 


86, Appl 


c 


18 


38 


.8 


3. 


6 


1206 


3 


US-09-252 


-991A-92 


Sequence 


92, Appl 


c 


19 


38 


3. 


5 


2199 


3 


US-09-902 


-540-6388 


Sequence 


6388, Ap 


c 


20 


38 


3. 


5 


2585 


3 


US-09-902 


-540-424 


Sequence 


424, App 


c 


21 


37 


.6 


3. 


5 


480 


3 


US-09-252 


-991A-2705 


Sequence 


2705, Ap 


c 


22 


37 


.6 


3. 


5 


594 


3 


US-09-252 


-991A-2890 


Sequence 


2890, Ap 




23 


37 


.6 


3. 


5 


784 


3 


US-10-017 


-754-1866 


Sequence 


1866, Ap 




24 


37 


.6 


3. 


5 


785 


3 


US-10-017 


-754-1868 


Sequence 


1868, Ap 




25 


37 


.6 


3. 


5 


789 


3 


US-10-017 


-754-1867 


Sequence 


18 67, Ap 




26 


37 


.6 


3. 


5 


790 


3 


US-10-017 


-754-1865 


Sequence 


1865, Ap 




27 


37 


.6 


3. 


5 


1635 


3 


US-09-234 


-332-4 


Sequence 


4, Appli 




28 


37 


.6 


3. 


5 


1635 


3 


US-09-702 


-705-1798 


Sequence 


1798, Ap 




29 


37 


.6 


3. 


5 


1635 


3 


US-09-736 


-457-1798 


Sequence 


1798, Ap 




30 


37 


.6 


3. 


5 


1635 


3 


US-09-671 


-325-1798 


Sequence 


17 98, Ap 




31 


37 


.6 


3. 


5 


1635 


3 


US-10-017 


-754-1798 


Sequence 


1798, Ap 


c 


32 


37 


.6 


3. 


5 


23901 


3 


US-09-94 9 


-016-16773 


Sequence 


16773, A 


c 


33 


37 


.4 


3. 


5 


7218 


2 


US-08-232 


-463-14 


Sequence 


14, Appl 


c 


34 


37 


.2 


3. 


4 


505 


3 


US-09-621 


-976-15639 


Sequence 


15639, A 




35 


37 


.2 


3. 


4 


1500 


2 


US-08-487 


-037-4 


Sequence 


4, Appli 




36 


37 


.2 


3. 


4 


5204 


3 


US-10-037 


-417-7 


Sequence 


7, Appli 




37 


37 


.2 


3. 


4 


5640 


3 


US-09-620 


-312D-41 


Sequence 


41, Appl 




38 


37 


.2 


3. 


4 


8296 


3 ' 


US-10-037 


-182-35 


Sequence 


35, Appl 




39 


37 


.2 


3. 


4 


11350 


3 


US-10-037 


-182-1 


Sequence 


1, Appli 




40 


37 


3. 


4 


5877 


3 


US-10-152 


-886-54 


Sequence 


54, Appl 


c 


41 


36 


.8 


3. 


4 


12055 


3 


US-09-949 


-016-13233 


Sequence 


13233, A 


c 


42 


36 


.6 


3. 


4 


465 


3 


US-09-902 


-540-7880 


Sequence 


7880, Ap 




43 


36 


.6 


3. 


4 


639 


3 


US-09-252 


-991A-13260 


Sequence 


13260, A 


c 


44 


36 


.6 


3. 


4 


942 


4 


US-09-754 


-949-1 


Sequence 


1, Appli 


c 


45 


36 


.6 


3. 


4 


942 


5 


US-10-417 


-422-1 


Sequence 


1, Appli 



ALIGNMENTS 1 



RESULT 1 

US-10-094-749-823 

; Sequence 823, Application US/10094749 

; Patent No. 6979557 

; GENERAL INFORMATION: 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; APPLICANT 
; TITLE OF INVENTION: NOVEL FULL-LENGTH cDNA 
; FILE REFERENCE: 084 335/0160 
; CURRENT APPLICATION NUMBER: US/ 10/094 , 74 9 
; CURRENT FILING DATE: 2002-03-12 
; PRIOR APPLICATION NUMBER: 60/350,435 
; PRIOR FILING DATE: 2002-01-24 
; PRIOR APPLICATION NUMBER: JP 2001-328381 
; PRIOR FILING DATE: 2001-09-14 
; NUMBER OF SEQ ID NOS: 3381 



ISOGAI, TAKAO 
SUGIYAMA, TOMOYASU 
OTSUKI, TETSUJI 
WAKAMATSU, AI 
SATO, HIROYUKI 
ISHII, SHIZUKO 
YAMAMOTO, JUN-ICHI 
ISONO, YUUKO 
HIO, YURI 
OTSUKA, KAORU 
NAGAI, KEIICHI 
IRIE, RYOTARO 

* TAMECHIKA, ICHIRO 
SEKI, NAOHIKO 
YOSHIKAWA, TSUTOMU 
OTSUKA, MOTOYUKI 
NAGAHARI, KENJI 
MASUHO, YASUHIKO 
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; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 823 

LENGTH: 2703 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-094-749-823 

Query Match 99.7%; Score 1076.8; DB 4; Length 2703; 

Best Local Similarity 99.8%; Pred. No. l.le-277; 

Matches 1078; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

I I I I 1 1 I I I I M I I I I I I I I I I 1 1 I I I I I II I I I I I 1 1 1 1 I I I I 1 1 I I I I I I I I I I I I I I 
Db 14 5 AT GC CTTT CCTT TT GGGT CT T AGACAGGAT AAGGAAGCCTGT GT GGGT AC CAAC AATC AA 204 

Qy 61 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II 

Db 205 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 2 64 

Qy 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

< I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 265 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 324 

Qy 181 T GCC AC C ACC GC CG AGCC AAGC AC C GC CT T C AGGCC C AGC AGCGGC AAC AT GAAAT CAAC 240- 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 325 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 384 

Qy 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 385 CT GAT CGCTT AC CG AGAAGC CC AC AAT T ACT C AGCGCT GCC AT T TT AT TT C AGGT T TT T G 44 4 

Qy 301 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I II I I I I I I I I I 
Db 44 5 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 504 

Qy 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 420 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I 
Db 505 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 564 

Qy 4 21 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 565 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 624 

Qy 481 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 540 

I I I I I 1 1 I I I II I II I I I I I 1 1 1 1 I I I I I i I I 1 1 1 1 I 1 1 1 1 I I I 1 1 1 1 I I I I I I I I I I I I 

Db 625 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 684 

Qy 541 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 685 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 74 4 

Qy 601 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGAC7AAAGATGCAGAATGTAGGGAG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I fl I I I I I I I I I I I I I I I 
Db 74 5 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 804 

Qy 661 GAGCTGCT GAAAGATGAC AGCT CT GAAC AC GGCGCACCCGACAGCAAAGAGAAGACGC CT 720 

I I I I I M I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 805 GAGCTGCT GAAAGATGAC AGCT CT GAACACGGCGCACCCGAC AGCAAAGAGAAGACGC CT 8 64 

Qy 721 GGGAGAC AT C GC CGCT T C AC AGGT GACT CGGGC ATT GAAGT GT GT GTGT GC AAC CGGGGC 780 

I I I I I II I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 865 GGGAGACATCGCCGCTTC AC AGGT GACT CGGGCATTG7\AGTGTGTGTGTGC7\ACCGGGGC 924 

Qy 781 C ACC AT GACGAT GACCTC AAAGAGTT CAAC AC ACTC AT CGAT GATGCT CT GGAT GGGC CC 840 

I I I i 1 1 I I 1 1 1 1 1 1 I I I I I I I I I I I I 1 1 I I I I I I I I 1 1 1 1 1 1 I I I I I I I I I I I I I I I I I I 
Db 925 CACCATGACGATGACCTCAAAGAGTTCAACACACTC AT CGATGATGCT CTGGATGGGCCC 984 

Qy 841 CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 985 CT GGACTT CT GCGACAGCTGCC AT GT GCGGCC CCCTGGTGAT GAGGAGGAAGGCCT CT GT 104 4 

Qy 901 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 960 

III I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I II I II ' 

Db 104 5 CAGCCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCTCA 1104 

Qy 961 TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 1020 
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I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1105 T GCCTGCT GC T G AAC ACC AT C AAC GAGC AGGACT CT CC C AACT G CC AG AGCAGC AGCT CC 1164 

Qy 1021 CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1165 CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1224 



RESULT 2 

US-09-513-999C-274 

; Sequence 274, Application US/09513999C 
; Patent No. 6783961 
; GENERAL INFORMATION: 

; APPLICANT: Dumas Milne Edwards, J.B. 
; APPLICANT: Duclert, A. 
; APPLICANT: Giordano, J.Y. 

; TITLE OF INVENTION: Expressed Sequence Tags and Encoded Human Proteins. 

; Patent No. 6783961 

; FILE REFERENCE: 59.US2.REG 

; CURRENT APPLICATION NUMBER: US/09/513 , 999C 

; CURRENT FILING DATE: 2000-02-24 

; PRIOR APPLICATION NUMBER: US 60/122,487 

; PRIOR FILING DATE: 1999-02-26 

; NUMBER OF SEQ ID NOS: 36681 

; SOFTWARE: Patent. pm 

; SEQ ID NO 274 

LENGTH: 399 , 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE: 

NAME/ KEY: CDS 

LOCATION: 81.. 398 

FEATURE: 

NAME/ KEY : sig_peptide 

LOCATION: 81.. 152 

OTHER INFORMATION: score 10 

OTHER INFORMATION: seq LLLLQALPSPLSA/RA 

FEATURE: 

NAME /KEY : misc_feature 
LOCATION: 251 

OTHER INFORMATION: y=c or t 
US-09-513-999C-274 



Query Match 21.4%; Score 231.6; DB 3; Length 399; 

Best Local Similarity 99.6%; Pred. No. 4.6e-52; 

Matches 231; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 


25 


C AGG AT AAGG AAGC CT GT GT GGGT AC C AAC AATC AAAG CT AC AT CT GT GAC AC AGG AC AC 


84 






1 1 1 1 1 1 i 1 II 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


168 


CAGGAT AAGG AAGC CT GT GT GGGT ACCAAC AAT C AAAGCT AC AT CT GT GAC ACAGG AC AC 


227 


Qy 


85 


TGCT GTGGACAGTCTCAGTGCTGCAACT ACTACT AT GAACTCTGGTGGTT CT GGCTGGTG 


144 






1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 :| 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 t 1 1' 




Db 


228 


T GCT GT GG AC AGTCT C AGT GCT GY AACT ACTACT AT GAACT CT GGT GGT T CT GGCT GGT G 


287 


Qy 


145 


TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 


204 






I ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


288 


TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 


347 


Qy 


205 


CGCCTT CAGGCCCAGCAGCGGCAACATGAAAT CAAC CT GAT CGCTT AC CGAG 256 








1 1 1 1 1 1 1 ti 1 1 1 1 r 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


348 


CGCCTT CAGGCCCAGCAGCGGCAACATGAAAT CAAC CT GATCGCTTACCGAG 399 




RESULT 3 








US-09-471 


-276- 


-82 





Sequence 82, Application US/09471276 
Patent No. 6822072 
GENERAL INFORMATION: 

APPLICANT: Dumas Milne Edwards, J.B. 

APPLICANT: Duclert A. 

APPLICANT: Giordano, J.Y. 

TITLE OF INVENTION: Expressed Sequence Tags and Encoded Human Proteins. 
Patent No. 6822072 
FILE' REFERENCE: GENSET . 025CP1 
CURRENT APPLICATION NUMBER: US/09/471,276 
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; CURRENT FILING DATE : 1999-12-21 

; EARLIER APPLICATION NUMBER: 09/057,719 

; EARLIER FILING DATE: 1998-04-09 

; EARLIER APPLICATION NUMBER: 09/069,047 

; EARLIER FILING DATE: 1998-04-28 

; EARLIER APPLICATION NUMBER: PCT/IB99/00712 

; EARLIER FILING DATE: 1999-04-09 

; NUMBER OF SEQ ID NOS : 1622 

; SOFTWARE: Patent. pm 

; SEQ ID NO 82 

LENGTH: 399 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME /KEY : CDS 
LOCATION: 81.. 398 
NAME/KEY: sig_peptide 
LOCATION: 81.. 152 

OTHER INFORMATION: Von Heijne matrix 
OTHER INFORMATION: score 10 
OTHER INFORMATION: seq LLLLQAL PS PLSA/ RA 
US-09-471-276-82 



Query Match 21.4%; Score 231.6; DB 3; Length 399; 

Best Local Similarity 99.6%; Pred. No. 4.6e-52; 

Matches 231; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 25 CAGGAT AAGGAAGCCT GT GT GGGT ACCAAC AATC AAAGCT AC AT CTGT GACACAGGAC AC 84 

1 1 I M I I I I i I I I I I I 11 I I I I I I I I I I II I II II I I I I I I I I I M I II I I I I I I I I I I I 
Db 168 C AGG AT AAGGAAGCCTGTGT GGGT ACCAAC AATC AAAGCT AC AT CTGT GACACAGGAC AC 227 

Qy 85 T GCT GT GGAC AGT CT C AGT GCT GC AACT ACT ACT AT GAACT CT GGT GGTT CT GGCT GGT G 144 

I I I I I I I M I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I 
Db 228 T GCT GT GGAC AGT CTC AGT GCT GY AACT ACT ACT AT GAACT CT GGT GGTT CT GGCT GGT G 287 



Qy 14 5 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 204 

I I I I I I I I I I I II I I I I I I I I I I I I 1 I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I 
Db 288 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 347 

Qy 205 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAG 256 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 34 8 C GCC TT C AGGCCC AGC AGCGGC AAC ATG AAAT C AAC CT GAT C GCT T AC C GAG 399 



RESULT 4 

US-10-094-749-344 

Sequence 344, Application US/1009474 9 
Patent No. 6979557 
GENERAL INFORMATION: 
APPLICANT: ISOGAI, TAKAO 
APPLICANT: SUGIYAMA, TOMOYASU 
APPLICANT: OTSUKI, TETSUJI 
APPLICANT: WAKAMATSU, AI 
APPLICANT: SATO, HIROYUKI 
APPLICANT: ISHII, SHIZUKO 
APPLICANT: YAMAMOTO, JUN-ICHI 
APPLICANT: ISONO, YUUKO 
APPLICANT: HIO, YURI 
APPLICANT: OTSUKA, KAORU 
APPLICANT: NAG A I , KEIICHI 
APPLICANT: IRIE, RYOTARO 
APPLICANT: TAMECHIKA, ICHIRO 
APPLICANT: SEKI, NAOHIKO 
APPLICANT: YOSHIKAWA, TSUTOMU 
APPLICANT: OTSUKA, MOTOYUKI 
APPLICANT: NAGAHARI, KENJI 
APPLICANT: MASUHO, YASUHIKO 
TITLE OF INVENTION: NOVEL FULL-LENGTH cDNA 
FILE REFERENCE: 084335/0160 
CURRENT APPLICATION NUMBER: US/ 10/094 , 74 9 
CURRENT FILING DATE: 2002-03-12 
PRIOR APPLICATION NUMBER: 60/350,435 
PRIOR FILING DATE: 2002-01-24 
PRIOR APPLICATION NUMBER: JP 2001-328381 
PRIOR FILING DATE: 2001-09-14 
NUMBER OF SEQ ID NOS: 3381 
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; SOFTWARE: Paten tin Ver. 2.1 
; SEQ ID NO 34 4 

LENGTH: 204 2 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-094-749-344 

Query Match 11.4%; Score 123.2; DB 4; Length 204 2; 

Best Local Similarity 73.1%; Pred. No. 8.5e-23; 

Matches 158; Conservative 0; Mismatches 58; Indels 0; Gaps 0; 



Qy 52 AACAATCAAAGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAAC 111 

I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1123 AACAACCAGCCCTACCTCTGTGAGAGTGGTCACTGCTGCGGGGAGACTGGCTGCTGCACC 1182 

Qy 112 TACT ACT ATGAACT CT GGTGGT T CTGGCTGGT GT GGAC CATC AT CAT CAT CCTGAGCT GC 171 

I I I I I I I I I I I I I I f I I I ! I I I I I I I I I I ! I I I I I II Mill II I llllll 

Db 1183 TACT ACT AT G AGCT CT GGT GGT TCTGGCTGCTCT GGACT GT C CT CAT C CT CT TT AGCT GC 1242 

Qy 17 2 TGCTGTGTTTGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACAT 231 

II I I I I II llllll I I I I I II I I I II II I I I I I I I I I I I I I 

Db 124 3 TGTTGCGCCTTCCGCCACCGACGAGCTAAACTCAGGCTGCAACAACAGCAGCGGCAGCGT 1302 

Qy 232 GAAATCAACCTGATCGCTTACCGAGAAGCCCACAAT 267 

I I I I I II I I I I I I I I I I I II III 
Db 1303 GAAATCAACT T GTT GGCCT AT CAT GGGGCATGCC AT 1338 



RESULT 5 

US-09-247-155-182 

; Sequence 182, Application US/09247155A 
; Patent No. 6312922 
; GENERAL INFORMATION: 

; APPLICANT: Dumas Milne Edwards, Jean-Baptiste 

APPLICANT: Duclert, Aymeric 

APPLICANT: Bougueleret, Lydie 
; TITLE OF INVENTION: Complementary DNAs 
; FILE REFERENCE: GENSET. 02 1A 

; CURRENT APPLICATION NUMBER: US/09/247 , 155A 

; CURRENT FILING DATE: 1999-02-09 

; EARLIER APPLICATION NUMBER: 60/074,121 

; EARLIER FILING DATE: 1998-02-09 

; EARLIER APPLICATION NUMBER: 60/081,563 

; EARLIER FILING DATE: 1998-04-13 

; EARLIER APPLICATION NUMBER: 60/096,116 

; EARLIER FILING DATE: 1998-08-10 

; EARLIER APPLICATION NUMBER: 60/099,273 

; EARLIER FILING DATE: 1998-10-04 

; NUMBER OF SEQ ID NOS: 182 

SOFTWARE: Patent.pm 
; SEQ ID NO 182 

LENGTH: 1292 

TYPE: DNA 

ORGANISM: Mus musculus 
US-09-247-155-182 

Query Match 11.4%; Score 122.6; DB 3; Length 1292; 

Best Local Similarity 74.2%; Pred. No. le-22; 

Matches 155; Conservative 0; Mismatches 54; Indels 0; Gaps 0; 



Qy 52 AACAATCAAAGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAAC 111 

III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 
Db 351 AATACCCAGCCCTACCTCTGTGAGACTGGTCATTGCTGTGGGGAGACTGGCTGCTGCACC 410 

Qy 112 TACTACTATGAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGC 171 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II Mill II I llllll 
Db 411 TACT ACT AT G AACT CT GGT GGT T CT GGCTGCT T T GG ACT GT C CT CAT C CT CT TT AGCT GC 4 70 

Qy 172 TGCTGTGTTTGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACAT 231 

llllll I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 471 TGTT GT GCCT TCCGCC ACCGAAGGGCT AAACT CAGGCT GC AACAGC AACAGC GGCAGCGT 530 

Qy 232 GAAATCAACCTGATCGCTTACCGAGAAGC 260 

I I II I I I I I I I I I I I I I I I I II 
Db 531 GAAATCAACTTGTTGGCTTACCATGGGGC 559 
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RESULT 6 

US-09-903-190-182 

Sequence 182, Application US/09903190 
Patent No. 6936692 . 
GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, Jean-Baptiste 
APPLICANT: Duclert, Aymeric 
APPLICANT: Bougueleret, Lydie 
TITLE OF INVENTION: Complementary DNAs 
FILE REFERENCE: GENSET . 02 1A 
CURRENT APPLICATION NUMBER: US/09/903,190 
CURRENT FILING DATE: 2001-07-11 
PRIOR APPLICATION NUMBER: US/09/247 , 1 55A 
PRIOR FILING DATE: 1999-02-09 

PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/074,121 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-02-09 
PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/081,563 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-04-13 
PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/096,116 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-08-10 
PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/099,273 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-10-04 
NUMBER OF SEQ ID NOS : 182 
SOFTWARE: Patent.pm 
SEQ ID NO 182 
LENGTH: 12 92 
TYPE: DNA 

ORGANISM: Mus musculus 
US-09-903-190-182 

Query Match 11.4%; Score 122.6; DB 3; Length 1292; 

Best Local Similarity 74.2%; Pred. No. le-22; 

Matches 155; Conservative 0; Mismatches 54; Indels 0; Gaps 0; 

Qy 52 AACAATCAAAGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAAC 111 

III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 351 AATACCCAGCCCTACCTCTGTGAGACTGGTCATTGCTGTGGGGAGACTGGCTGCTGCACC 410 

Qy 112 TACTACTATGAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGC 171 

I I I I I I I I I I I i I I I I I I I M I I I I I I I I I I I I I I I i I I I I I I II I I I I I I I 
Db 411 TACTACTATGAACTCTGGTGGTTCTGGCTGCTTTGGACTGTCCTCATCCTCTTTAGCTGC 470 

Qy 17 2 TGCTGTGTTTGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACAT 231 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I 1 I 

Db 471 TGTTGTGCCTTCCGCCACCGAAGGGCTAAACTCAGGCTGCAACAGCAACAGCGGCAGCGT 530 

Qy 232 GAAATCAACCTGATCGCTTACCGAGAAGC 260 

I 1 I I I I I I I I I I I I I I I II I II 
Db 531 GAAATCAACTTGTTGGCTTACCATGGGGC 559 

RESULT 7 

US-09-513-999C-3746 

Sequence 374 6, Application US/09513999C 
Patent No. 6783961 
GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, J.B. 
APPLICANT: Duclert, A. 
APPLICANT: Giordano, J.Y. 

TITLE OF INVENTION: Expressed Sequence Tags and Encoded Human Proteins. 
Patent No. 6783961 
FILE REFERENCE: 59.US2.REG 

CURRENT APPLICATION NUMBER: US/09/513, 999C 
CURRENT FILING DATE: 2000-02-24 
PRIOR APPLICATION NUMBER: US 60/122,487 
PRIOR FILING DATE: 1999-02-26 
NUMBER OF SEQ ID NOS: 36681 
SOFTWARE: Patent.pm 
SEQ ID NO 3746 
LENGTH: 48 9 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME /KEY: CDS 
LOCATION: 126.. 455 
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FEATURE: 

NAME/KEY: sig_peptide 
LOCATION: 126.. 233 
OTHER INFORMATION: score 9.7 
OTHER INFORMATION: seq LWTVLILFSCCCA/FX 
FEATURE: 

NAME/KEY : misc_f eature 
LOCATION: 237 

OTHER INFORMATION: n=a, g, c or t 
FEATURE: 

NAME /KEY : misc_f eature 
LOCATION: 281 

OTHER INFORMATION: s=g or c 
FEATURE: 

NAME/KEY: misc_f eature 
LOCATION: 361 

OTHER INFORMATION: s=g or c 
FEATURE: 

NAME /KEY : misc_feature 
LOCATION: 378 

OTHER INFORMATION: n=a , g, c or t 
FEATURE: 

NAME /KEY: misc_feature 
LOCATION: 406 

OTHER INFORMATION: m=a or c 
FEATURE: 

NAME/KEY: misc_f eature 
LOCATION : 410 
OTHER INFORMATION: n=a, 
FEATURE : 

NAME/ KEY: mi sc_f eature 
LOCATION: 411 
OTHER INFORMATION: n=a, 
FEATURE: 

NAME /KEY : misc_f eature 
LOCATION-: 422 

OTHER INFORMATION: m=a or c 
FEATURE: 

NAME /KEY: UNSURE 
LOCATION: 2 

OTHER INFORMATION: Xaa=Cys or Gly or Arg or Ser 
FEATURE : 

NAME /KEY: UNSURE 
LOCATION: 16 

OTHER INFORMATION: Xaa=His or Gin 
FEATURE: 

NAME /KEY: UNSURE 
LOCATION: 4 3 

OTHER INFORMATION: Xaa=Ala or Gly 
FEATURE: 

NAME /KEY : UNSURE 
LOCATION: 4 9 

OTHER INFORMATION: Xaa=Cys or Gly or Arg or Ser 
FEATURE: 

NAME /KEY: UNSURE • 
LOCATION: 58 

OTHER INFORMATION: Xaa=Asn or Thr 
FEATURE: 

NAME /KEY: UNSURE 
LOCATION: 60 

OTHER INFORMATION: Xaa= * or Glu or Lys or Gin 
FEATURE: 

NAME /KEY: UNSURE 
LOCATION: 63 

OTHER INFORMATION: Xaa=His or Gin 
US-09-513-999C-3746 



c or t 



c or t 



Query Match 7.6%; Score 82; DB 3; Length 4 89; 

Best Local Similarity 72.7%; Pred. No. 4.9e-12; 

Matches 117; Conservative 1; Mismatches 42; Indels 



Gaps 



1; 



Qy 107 GCAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGA 166 

III II I I I II I I I I I I I I I I I I I I I I II I I I I ! I Mill M I I I II II 1 I 
Db 160 GCACCTACTACTATGAGCTCTGGTGGTTCTGGCTGCTCTGGACTGTCCTCATCCTCTTTA 219 

Qy 167 GCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGC 226 
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! I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 

Db 220 GCTGCTGTTGCGCCTTCNGCCACCGACGAGCTAAACTCAGGCTGCAACAACAGCAGCGGC 279 

Qy 227 AAC AT G AAAT C AACCT GATC GCTT AC CG AG AAGC CC AC AAT 267 

i : I I I I I I I I I I • I I I I I I I I I II III 
Db 280 -ASGTGAAATCAACTTGTTGGCCTATCATGGGGCATGCCAT 319 



RESULT 8 

US-09-247-155-83 

Sequence 83, Application US/09247155A 
Patent No. 6312922 
GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, Jean-Baptiste 
APPLICANT: Duclert, Aymeric 
APPLICANT: Bougueleret, Lydie 
TITLE OF INVENTION: Complementary DNAs 
FILE REFERENCE: .GENSET .021A 

CURRENT APPLICATION NUMBER: US/09/247 , 1 55A 
CURRENT FILING DATE: 1999-02-09 
EARLIER APPLICATION NUMBER: 60/074,121 
EARLIER FILING DATE: 1998-02-09 
EARLIER APPLICATION NUMBER: 60/081,563 
EARLIER FILING DATE: 1998-04-13 
EARLIER APPLICATION NUMBER: 60/096,116 
EARLIER FILING DATE: 1998-08-10 
EARLIER APPLICATION NUMBER: 60/099,273 
EARLIER FILING DATE: 1998-10-04 
NUMBER OF SEQ ID NOS : 182 
SOFTWARE: * Paten t.pm 
SEQ ID NO 83 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/KEY: CDS 
LOCATION: 61 2.. 64 4 
FEATURE: 

NAME/KEY: polyA_signal . ' 

LOCATION: 829. .834 
FEATURE: 

NAME/ KEY : polyA_site 
LOCATION: 850. .861 
US-09-247-155-83 

Query Match 7.4%; Score 80.4; DB 3; Length 861; 

Best Local Similarity 71.9%; Pred. No. 1.7e-ll; 

Matches 105; Conservative 0; Mismatches 41; Indels 0; Gaps 0; 

Qy 122 AACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTT 181 

I I I I I I I I I I I I I I M I I I I I I I I I II Mill II I I 1 I I I I I I I I . I I 
Db 1 AGCT CT GGT GGT TCTGGCT GCT CT GGACT GT C CT CAT C CT CT TT AGCT GCT GTT GC GC CT 60 

Qy 182 GCCACCACCGCCGAGCC7\AGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACC 241 

II MINI I I I I I II I I I II II I I I I I I I I I I I I I I I I I I I I I I 

Db 61 TCCGCCACCGACGAGCTAAACTCAGGCTGCAACAACAGCAGCGGCAGCGTGAAATCAACT 120 

Qy 24 2 T GAT CGCTTACCGAGAAGCCCACAAT 267 

1 ! I I I I I I I II III 
Db 121 TGTTGGCCTATCATGGGGCATGCCAT 14 6 



RESULT 9 

US-09-903-190-83 

; Sequence 83, Application US/09903190 

; Patent No. 6936692 

; GENERAL INFORMATION: 

; APPLICANT: Dumas Milne Edwards, Jean-Baptiste 

; APPLICANT: Duclert, Aymeric 

; APPLICANT: Bougueleret, Lydie 

; TITLE OF INVENTION: Complementary DNAs 

; FILE REFERENCE: GENSET . 02 1A 

; CURRENT APPLICATION NUMBER: US/09/903,190 

; CURRENT FILING DATE: 2001-07-11 

; PRIOR APPLICATION NUMBER: US/09/247 , 1 55A 

; PRIOR FILING DATE: 1999-02-09 
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PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/074,121 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-02-09 
PRIOR. APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/081, 563 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-04-13 
PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/096,116 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-08-10 
PRIOR APPLICATION NUMBER: EARLIER APPLICATION NUMBER: 60/099,273 
PRIOR FILING DATE: EARLIER FILING DATE: 1998-10-04 
NUMBER OF SEQ ID NOS : 182 
SOFTWARE: Patent.pm 
SEQ ID NO 83 
LENGTH: 861 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/ KEY: CDS 
LOCATION: 612.. 644 
FEATURE: 

NAME /KEY : polyA_signal 
LOCATION: 829.. 834 
FEATURE: 

NAME /KEY : polyA_site 
LOCATION: 850. .8 61 
US-09-903-190-83 

Query Match 7.4%; Score 80.4; DB 3; Length 8 61; 

Best Local Similarity 71.9%; Pred. No. 1.7e-ll; 

Matches 105; Conservative 0; Mismatches 41; Indels 0; Gaps 0; 

Qy 122 AACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTT 181 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I 
Db 1 AGCT CT GGT GGT T CTGGCT GCT C£ GG ACT GT C CT CAT C CT CTT T AGCT GCT GT T GC GCCT 60 

Qy 182 GC C ACC AC CGCC GAGC C AAGC ACCGC CT T C AGGC CC AGC A GCGGC AAC AT GAAAT C AACC 241 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db ' 61 T C CGCC ACCGAC GAGCT AAACT CAGGCT GC AACAAC AGCAGCGGCAGCGT GAAATC AACT 120 

Qy 24 2 TGATCGCTTACCGAGAAGCCCACAAT 267 

I I I I I I I I I I I II! 
Db 121 TGTTGGCCTATCATGGGGCATGCCAT 14 6 



RESULT 10 
US-09-148-545-55 

; Sequence 55, Application US/09148545 

; Patent No. 6590075 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION: 70 Human Secreted Proteins 
; FILE REFERENCE: PZ001P1 

; CURRENT APPLICATION NUMBER: US/09/148,54 5 

; CURRENT FILING DATE: 1998-09-04 

; EARLIER APPLICATION NUMBER: PCT/US98/04482 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040,162 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,333 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/038,621 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,161 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,626 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,334 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,336 

; . EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,163 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/047,615 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,600 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,597 

; EARLIER FILING DATE: 1997-05-23 
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EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 
EARLIER APPLICATION 
EARLIER FILING DATE: 



NUMBER: 60/047,502 
: 1997-05-23 
NUMBER: 60/047,633 
: 1997-05-23 
NUMBER: 60/047,583 
: 1997-05-23 
NUMBER: 60/047,617 
: 1997-05-23 
NUMBER: 60/047,618 
: 1997-05-23 
NUMBER: 60/047,503 

1997-05-23 
NUMBER: 60/047,592 

1997-05-23 
NUMBER: 60/047,581 

1997-05-23 
NUMBER: 60/047,584 

1997-05-23 
NUMBER: 60/047,500 

1997-05-23 
NUMBER: 60/047,587 

1997-05-23 
NUMBER: 60/047,492 

1997-05-23 
NUMBER: 60/047,598 

1997-05-23 
NUMBER: 60/047,613 

1997-05-23 
NUMBER: 60/047,582 

1997-05-23 
NUMBER: 60/047,596 

1997-05-23 
NUMBER: 60/047,612 

1997-05-23 
NUMBER: 60/047,632 

1997-05-23 
NUMBER: 60/047,601 

1997-05-23 
NUMBER: 60/043,580 

1997-04-11 
NUMBER: 60/04 3,568 

1997-04-11 
NUMBER: 60/043,314 

1997-04-11 
NUMBER: 60/043,569 

1997-04-11 
NUMBER: 60/04 3,311 

1997-04-11 
NUMBER: 60/04 3,671 

1997-04-11 
NUMBER: 60/043,674 

1997-04-11 
NUMBER: 60/04 3,669 

1997-04-11 
NUMBER: 60/043,312 

1997-04-11 
NUMBER: 60/043,313 

1997-04-11 
NUMBER: 60/04 3,672 

1997-04-11 
NUMBER: 60/04 3,315 

1997-04-11 
NUMBER: 60/048,974 

1997-06-06 
NUMBER: 60/056,886 

1997-08-22 
NUMBER: 60/056,877 

1997-08-22 
NUMBER: 60/056,889 

1997-08-22 
NUMBER: 60/056,893 

1997-08-22 
NUMBER: 60/056,630 

1997-08-22 
NUMBER: 60/056,878 

1997-08-22 
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EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
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EARLIER FILING DATE 
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EARLIER FILING DATE 
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EARLIER FILING DATE 
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EARLIER FILING DATE 
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EARLIER FILING DATE 
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EARLIER FILING DATE 
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EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
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EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 
EARLIER APPLICATION 
EARLIER FILING DATE 



NUMBER: 60/056,662 
: 1997-08-22 

NUMBER: 60/056,872 
: 1997-08-22 

NUMBER: 60/056,882 
: 1997-08-22 

NUMBER: 60/056,637 
: 1997-08-22 

NUMBER: 60/056,903 
: 1997-08-22 

NUMBER: 60/056,888 
: 1997-08-22 

NUMBER: 60/056,879 
: 1997-08-22 

NUMBER: 60/056,880 
: 1997-08-22 

NUMBER: 60/056,894 
: 1997-08-22 

NUMBER: 60/056,911 
: 1997-08-22 

NUMBER: 60/056,636 
: 1997-08-22 

NUMBER: 60/056,874 
: 1997-08-22 

NUMBER: 60/056,910 
: 1997-08-22 

NUMBER: 60/056,864 
: 1997-08-22 

NUMBER: 60/056,631 
: 1997-08-22 

NUMBER: 60/056,845 
: 1997-08-22 

NUMBER: 60/056,892 
: 1997-08-22 

NUMBER: 60/047,595 
: 1997-05-23 

NUMBER: 60/057,761 
: 05-Sep-1997 

NUMBER: 60/047,599 
: 1997-05-23 

NUMBER: 60/047,588 
: 1997-05-23 

NUMBER: 60/047,585 
: 1997-05-23 

NUMBER: 60/047,586 
: 1997-05-23 

NUMBER: 60/047,590 
: 1997-05-23 

NUMBER: 60/047,594 
: 1997-05-23 

NUMBER: 60/047,589 
: 1997-05-23 

NUMBER: 60/047,593 
: 1997-05-23 

NUMBER: 60/047,614 
: 1997-05-23 

NUMBER: 60/043,578 
: 1997-04-11 

NUMBER: 60/043,576 
: 1997-04-11 

NUMBER: 60/047,501 
: 1997-05-23 

NUMBER: 60/043,670 
: 1997-04-11 

NUMBER: 60/056,632 
: 1997-08-22 

NUMBER: 60/056,664 
: 1997-08-22 

NUMBER: 60/056,876 
: 1997-08-22 

NUMBER: 60/056,881 
: 1997-08-22 

NUMBER: 60/056,909 
: 1997-08-22 

NUMBER: 60/056,875 
: 1997-08-22 . 
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; EARLIER APPLICATION NUMBER: 60/056,862 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,887 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,908 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/048,964 
; EARLIER FILING DATE: 1997-06-06 
; EARLIER APPLICATION NUMBER: 60/057,650 
; EARLIER FILING DATE: 1997-09-05 
; EARLIER APPLICATION NUMBER: 60/056,884 
; EARLIER FILING DATE: 1997-08-22 
; NUMBER OF SEQ ID NOS : 280 
; SOFTWARE: Pa ten tin Ver. 2.0 
; SEQ ID NO 55 
LENGTH: 1903 

Query Match 4.0%; Score 43.2; DB 3; Length 1903; 

Best Local Similarity 58.6%; Pred. No. 0.2; 

Matches 75; Conservative 0; Mismatches 53; Indels 0; Gaps 0; 

• Qy 84 5 ACTT CTGCGACAGCTGCC AT GT GCGGCCCCCT GGTGAT GAGGAGGAAGGCCT CT GT CAGT 904 

I I I I I I I I 1 I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I 

Db 54 ACTTCTGCCACCGCGGGCCACCGCGGCCGCCTGATCCCGCAGAGGAAGGTCGCGGCCGTG 113 

Qy • 905 CCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCC 964 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 114 GAGCGATGACCCGCGGCGGTCCGGGCGGGCGCCCGGGGCTGCCACAGCCGCCGCCGCTTC 173 

Qy 965 TGCTGCTG 972 

I I I II I I I 
Db 174 TGCTGCTG 181 



RESULT 11 
US-09-621-011-55 

; Sequence 55, Application US/09621011 

; Patent No. 6878687 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION : 70 Human Secreted Proteins 
; FILE REFERENCE: PZ001P1 

; CURRENT APPLICATION NUMBER: US/09/621,011 
; CURRENT FILING DATE: 2000-07-20 

Prior application data removed - consult PALM or file wrapper 
; NUMBER OF SEQ ID NOS: 280 
; SOFTWARE: Pa tent In Ver. 2.0 
; SEQ ID NO 55 

LENGTH: 1903 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-621-011-55 

Query Match 4.0%; Score 43.2; DB 3; Length 1903; 

Best Local Similarity 58.6%; Pred. No. 0.2; 

Matches 75; Conservative 0; Mismatches 53; Indels 0; Gaps 0; 



Qy 84 5 ACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGT 904 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 54 ACTTCTGCCACCGCGGGCCACCGCGGCCGCCTGATCCCGCAGAGGAAGGTCGCGGCCGTG 113 

Qy 905 CCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCC 964 

I I I I I .1 I I I I I I I I I I I I I I I 1 I I 1 I I I I 
Db 114 GAGCGATGACCCGCGGCGGTCCGGGCGGGCGCCCGGGGCTGCCACAGCCGCCGCCGCTTC 173 

Qy ' 965 TGCTGCTG 972 

I I I I I I I I 
Db 174 TGCTGCTG 181 



RESULT 12 

US-09-252-991A-3601 

; Sequence 3601, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 

; APPLICANT: Marc J. Rubenfield et al. 
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; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO PSEUDOMONAS 

; TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 

; FILE REFERENCE: 107196.136 

; CURRENT APPLICATION NUMBER: US/09/252, 991A 

; CURRENT FILING DATE: 1999-02-18 

; PRIOR APPLICATION NUMBER: US 60/074,788 

; PRIOR FILING DATE: 1998-02-18 

; PRIOR APPLICATION NUMBER: US 60/094,190 

; PRIOR FILING DATE: 1998-07-27 

; NUMBER OF SEQ ID NOS: 33142 

; SEQ ID NO 3601 

LENGTH: 519 

TYPE: DNA 

ORGANISM: Pseudomonas aeruginosa 
US-09-252-991A-3601 

Query Match 3.8%; Score 40.6; DB 3; Length 519; 

Best Local Similarity 53.5%; Pred. No. 0.59; 



Matches 85; Conservative 0; Mismatches 74; Indels 0; Gaps 0; 

Qy 827 CTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGG 886 

Mil I I I I I I I I I I I I I I I I I I I I I I I I I I I I ( 

Db .85 CCCTCGGCGAACGCCTCGACAGCGGCGACATCCGCCGCCTGCTGCAACGCCAGGACGCCG 14 4 

Qy 887 AGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGC 946 

III II I SIM I I I I I I I I I I I I I I I I I I I I 

Db - 14 5 ACGACCCGATCCGCCAGTTGCACCAGGAGCTGCGCGAGGGCCTCGCGCGCCTGGACCCGC 204 

Qy 947 CACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAACG 985 

II i I i I 1 III Mil! I i II I II 

Db 205 GC GCGCT GCC CGGACGCGCC GT GCTGGACGCCTACGT C G 24 3 
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SCORE Search Results Details for Application 

10729895 and Search Result 
20070109_162402_us-10-729-895a-l.rnpbm. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 

20070109_162402_us-10-729-895a-l.rnpbm. 

start I next page 

Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: January 11, 2007, 02:09:39 ; Search time 1616 Seconds 

(without alignments) 
8212.033 Million cell updates/sec 

Title: US-10-729-895A-1 
Perfect score: 1080 

Sequence: 1 atgcctttccttttgggtct aaagcaaccagggtagggga 1080 



Scoring table: 



Searched: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

18892170 seqs, 6143817638 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 4 5 summaries 



37784340 



Database 



Published_Applications_NA_Main:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 



/EMC_Celerra_SIDS3/ptodata/2/pubpna/US07_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptoda ta/2/pubpna/US08_PUBCOMB . seq : * 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US09A_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US09B_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US09C__PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10A_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10B_PUBCOMB . seq : * 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10C_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10D_PUBCOMB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10E_PUBCOMB.seq: 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10F_PUBCOMB.seq: 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10G_PUBCOMB.seq: 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USl 1A_PUBC0MB . seq : 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllB_PUBCOMB.seq: 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USl 1C_PUBC0MB . seq : 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USl 1D_PUBC0MB . seq : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
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1 


1080 


100 


0 


1080 


12 


US-10-729-895-1 


Sequence 1, Appli 




2 


1080 


100 


0 


2976 


7 


US-10-297-880-7 


Sequence 7, Appli 




3 


1080 


100 


0 


4122 


12 


US-10-729-895-16 


Sequence 16, Appl 




4 


1076.8 


99 


7 


2703 


7 


US-10-094-749-823 


Sequence 823, App 




5 


1054 .4 


97 


6 


1508 


7 


US-10-291-172-32 


Sequence 32, Appl 




6 


1054 .4 


97 


6 


1508 


8 


US-10-221-278-32 


Sequence 32, Appl 




7 


1054 .4 


97 


6 


1550 


7 


US-10-119-428-35 


Sequence 35, Appl 




8 


1054.4 


97 


6 


3095 


3 


US-09-814-353-20113 


Sequence 20113, A 




9 


1053 


97 


5 


1140 


12 


US-10-729-895-3 


Sequence 3, Appli 




10 


1035.2 


95 


9 


1715 


7 


US-10-291-172-408 


Sequence 408, App 




11 


1035.2 


95 


9 


1715 


8 


US-10-221-278-408 


Sequence 408, App 




12 


783.2 


72 


5 


69081 


6 


US-10-087-192-1192 


Sequence 1192, Ap 


c 


13 


505 


46 


8 


505 • 


3 


US-09-864-7 61-84 4 3 


Sequence 8443, Ap 


c 


14 


4 30.2 


39 


8 


684973 


3 


US-09-263-959-1 


Sequence 1, Appli 




15 


423 


39 


2 


439 


3 


US-09-814-353-14 563 


Sequence 14563, A 




16 


402 .4 


37 


3 


566 


4 


US-09-925-065A-84 2189 


Sequence 842189, 




17 


4 02 .4 


37 


3 


566 


5 


US-09-925-065A-84 2189 


Sequence 842189, 


c 


18 


352 


32 


6 


352 


3 


US-09-864 -761-25171 


Sequence 25171, A 




19 


352 


32 


6 


382 


3 


US-09-8 14 -353-1832 


Sequence 1832, Ap 




20 


352 


32 


6 


382 


3 


US-09-814-353-8179 


Sequence 8179, Ap 




21 


293 


27 


1 


4 62 


3 


US-09-91 8 -995-9876 


Sequence 9876, Ap 




22 


231 .6 


21 


4 


399 


10 


US-10-926-683-82 


Sequence 82, Appl 


c 


23 


225.2 


20.9 


558 


12 


US-10-301-480-5034 59 


Sequence 503459, 


c 


24 


225.2 


20 


9 


558 


12 


US-10-301-4 80-11 16868 


Sequence 1116868, 


c 


25 


224 


20 


7 


528 


12 


US-10-301-480-5034 60 


Sequence 503460, 


c 


26 


224 


20 


7 


528 


12 


US-10-301-4 80- 11 16869 


Sequence 1116869, 


c 


27 


224 


20 


7 


568 


4 


US-09-92 5-065A-4 4 2874 


Sequence 442874, 


c 


28 


224 


20 


7 


568 


5 


US-09-925-065A-44 2874 


Sequence 442874, 


c 


29 


223 . 6 


20 


7 


569 


4 


US-09-92 5-06 5A-4 4 2873 


Sequence 442873, 


c 


30 


223 . 6 


20 


7 


569 


5 


US-09-925-065A-44 2873 


Sequence 442873, 




31 


223 


20 


6 


553 


4 


US-09-92 5-06 5A-40 9698 


Sequence 409698, 




32 


223 


20 


6 


553 


5 


US-09-92 5-06 5A-4 09698 


Sequence 409698, 


c 


33 


222.8 


20 


6 


562 


4 


US-09-92 5-065A- 12 5966 


Sequence 125966, 


c 


34 


222 .8 


20 


6 


562 


5 


US-09-92 5-065A- 12 59 66 


Sequence 125966, 


c 


35 


222 .8 


20 


6 


571 


12 


US-10-301-480-22334 9 


Sequence 223349, 


c 


36 


222 .8 


20 


6 


571 


12 


US- 10-30 1-480-83 67 58 


Sequence 836758, 




37 


222 .4 


20 


6 


528 


4 


US-09-92 5-065A-393320 


Sequence 393320, 




38 


222 .4 


20 


6 


528 


4 


US-09-92 5-065A- 393321 


Sequence 393321, 




39 


222 .4 


20 


6 


528 


5 


US-09-92 5-065A-393320 


Sequence 393320, 




40 


222.4 


■ 20 


6 


528 


5 


US-09-925-065A-393321 


Sequence 393321, 




41 


222.4 


20. 


6 


561 


12 


US-10-301-480-461406 


Sequence 461406, 




42 


222.4 


20. 


6 


561 


12 


US-10-301-480-461407 


Sequence 461407, 




43 


222.4 


20 


6 


561 


12 


US-10-301-480-1074 815 


Sequence 1074815, 




44 


222.4 


20 


6 


561 


12 


US-10-301-480-1074 816 


Sequence 1074816, 




45 


221.2 


20. 


5 


528 


4 


US-09-925-065A-393412 


Sequence 393412, 



ALIGNMENTS 



RESULT 1 
US-10-729-895-1 

; Sequence 1, Application US/10729895 

; Publication No. US20060063156A1 

; GENERAL INFORMATION: 

; APPLICANT: UNIVERSITY OF NEW MEXICO 

; TITLE OF INVENTION: OUTCOME PREDICTION AND RISK CLASSIFICATION IN CHILDHOOD 

; TITLE OF INVENTION: LEUKEMIA 

; FILE REFERENCE: N12-038US/310 . 00050101 

; CURRENT APPLICATION NUMBER: US/ 10/729 , 895 

; CURRENT FILING DATE: 2003-12-05 

; PRIOR APPLICATION NUMBER: 60/510,904 

; PRIOR FILING DATE: 2003-10-14 

; PRIOR APPLICATION NUMBER: 60/510,968 

; PRIOR FILING DATE: 2003-10-14 

; PRIOR APPLICATION NUMBER: 60/4 32,064 

; PRIOR FILING DATE: 2002-12-06 

; PRIOR APPLICATION NUMBER: 60/432,077 

; PRIOR FILING DATE: 2002-12-06 

; PRIOR APPLICATION NUMBER: 60/432,078 

; PRIOR FILING DATE: 2002-12-06 

; NUMBER OF SEQ ID NOS : 18 

; SOFTWARE: Patentln Ver. 3.2 

; SEQ ID NO 1 

LENGTH: 1080 

TYPE: DNA 
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; ORGANISM: Homo sapiens 

FEATURE: 

NAME /KEY: CDS 

LOCATION: (1)..(1026) 
US-10-729-895-1- 

Query Match 100.0%; Score 1080; DB 12; Length 1080; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1080; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 AT GC CT TT CCTT TT GGGT CT T AGACAGGAT AAGGAAGCCT GT GT GGGT ACCAAC AAT C AA 60 

Qy 61 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 AGCTACATCTGTGACACAGGACACTGCT GT GGACAGTCTCAGTGCTGCAACT ACTACT AT 120 

Qy 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

I I I I I I I I I I I I I t I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

Qy 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

Qy 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

I I I I I 1 I I i I I I 1 I I I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I I 1 I I I 11 I I 
Db 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

Qy 301 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 360 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 360 

Qy 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 4 20 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 4 20 

Qy 421 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 4 80 

Qy 481 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 540 

I I I I II I I I 1 I I I I I I 1 I I I I I I I 1 1 I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 481 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 540 

Qy 541 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600 

I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I 

Db 541 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600. 

Qy 601 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 660 

1 1 1 1 1 1 1 ( 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 601 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 660 

Qy 661 GAGCTGCTGAAAGATGACAGCT CT GAAC ACGGCGCACCCGAC AGCAAAGAGAAGACGC CT 720 

I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I I I 1 1 1 1 1 1 1 1 1 1 1 1 I M I 1 1 I I I i 1 1 1 1 1 1 

Db 661 GAGCT GCT GAAAGAT G AC AGCT CT GAAC AC GGCGC ACCCG AC AGC AAAGAGAAG AC GC CT 720 

Qy 721 GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 GGGAGAC AT C GC CGCT T C AC AGGT GACT CGGGC ATT GAAGT GT GT GT GT GC AAC CGGGGC 7 80 

Qy 781 CACCATGACGATGACCT C AAAGAGTT CAACAC ACTC AT CGAT GAT GCT CT GGAT GGGC CC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1.1 1 I I I I I I 

Db 781 CACC AT GACG AT GACCTC AAAGAGTT CAACAC ACT CAT CGAT GAT GCT CT GGAT GGGC CC 840 

Qy 841 CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 900 

I I I I I I I I I I I I 1 1 I I II I I 1 1 I I I I M I I I I I I 1 1 I I I I I I I I I I I I 1 1 I I I I I II I I I 

Db 841 CT GGACTT CT GCGACAGCTGCC AT GT GCGGCCCC CT GGTGAT GAGGAGGAAGGC CT CT GT 900 

Qy 901 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 C AGT CCT CT G AGGAGC AGGCT C GAGAGC CT GGGC AC CC GC AC CT GC C AC GGC CGC C C GC A 960 

Qy 961 TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 1020 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 961 TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 1020 

Qy 1021 CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1080 

I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1080 



RESULT 2 
US-10-297-880-7 

Sequence 7, Application US/10297880 
Publication No. US20030211513A1 
GENERAL INFORMATION: 
APPLICANT: INCYTE GENOMICS, INC.; YUE, Henry 
APPLICANT: HE, Ann; NGUYEN, Danniel B. 
APPLICANT: YAO, Monique G. ; BANDMAN, Olga 
APPLICANT: BUR FORD, Neil; TANG, Y. Tom 
APPLICANT: XU, Yuming; HAFALIA, April J. A. 
APPLICANT: AZIMZAI, Yalda; CHAWLA, Narinder K. 
TITLE OF INVENTION: INTRACELLULAR SIGNALING PROTEINS 
FILE REFERENCE: PF-0782 USN 
CURRENT APPLICATION NUMBER: US/10/297, 880 
CURRENT FILING DATE: 2002-12-09 
PRIOR APPLICATION NUMBER: PCT/US01/ 18595 
PRIOR FILING DATE: 2001-07-07 
PRIOR APPLICATION NUMBER: US 60/210,582 
PRIOR FILING DATE: 2000-06-08 
PRIOR APPLICATION NUMBER: US 60/212,443 
PRIOR FILING DATE: 2000-06-16 
NUMBER OF SEQ ID NOS : 10 
SOFTWARE: PERL Program 
SEQ ID NO 7 
LENGTH: 2976 
TYPE: .DNA 

ORGANISM: Homo sapiens 
FEATURE: 

NAME /KEY : misc_feature 

OTHER INFORMATION: Incyte ID No: 1478005CB1 
US-10-297-880-7 

Query Match 100.0%; Score 1080; DB 7; Length 2976; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1080; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I J I I I II I I I I I I I I I I I I I I I I I I I 
Db 479 ATGCCTTTCCTTTTGGGTCTTAGACAGGAT7\AGG7\AGCCTGTGTGGGTACC7\AC7\ATC7\A 538 

Qy 61 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I ! 
Db 539 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 598 

Qy 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 599 GAACTCT GGT GGTT CT GGCT GGT GT GGACC AT CATC AT CAT C CT GAGCTGCT GCT GT GT T 658 

Qy 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

II I I I I I I I I I I I I I I I I I I -I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 659 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC *718 

Qy 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 719 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 7 78 

Qy 301 CCAAACTATTTACTACCTCCTTATGAGG7U\GTGGTGAACCGACCTCC7U^CTCCTCCCCCA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I 
Db 779 CCAT^CTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCT^CTCCTCCCCCA 838 

Qy 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 4 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 839 CC AT AC AGT GCCT T CC AGCT AC AGC AGC AGC AGCTGCT GCCT CC AC AGT GT GGCCCT GC A 898 

Qy 421 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 480 

I I I I 1 1 1 1 I I 1 1 I I 1 1 I I I I 1 1 I I 1 1 II I I I I I I 1 1 I I I I I I I M I I I I I I I I I I I I I M 

Db 899 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 958 

Qy 481 TT GT CT GAGCCC AGC AGAAGCAGC AC AAGACCCCCAAGCATCGCTG AC CCTG AT CCCTCT 540 
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I I ! I I ! I ! I ! I I I i I I I I ! I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 



Db 


959 


TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 


1018 


Qy 


541 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


600 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 II 




Db 


1019 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


1078 


Qy 


601 


GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 


660 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1079 


GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 


1138 


Qy : 


661 


GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 


720 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1139 


GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 


1198 


Qy 


721 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


1199 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


1258 


Qy 


781. 


C AC CAT GAC GAT GACCT C AAAGAGTT C AAC AC ACT C AT CGAT GAT GCT CT GG AT GGGC CC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1259 


C AC CAT GAC GAT GAC C T C AAAGAGTT CAAC AC AC T C AT CGAT GAT GCT CT GGAT GGGC CC 


1318 


Qy 


841 


CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 


900 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1319 


CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 


1378 


Qy 


901 


CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 


960 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


137 9 


CAGT CCT CTGAGGAGC AGGCTC GAGAGCCT GGGCACCC GCACCTGCCACGGCCGCC CGCA 


1438 


Qy 


961 


TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 


1020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


1439 


TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 


1498 


Qy 


1021 


CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 


1080 






1 1 1 1 1 1 1 1 1 II 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 




Db 


1499 


CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 


1558 


RESULT 


3 






US-10- 


729-895- 


-16 




; Sequence 16, 


Application US/10729895 




; Publication 


No. US20060063156A1 





; GENERAL INFORMATION: 

; APPLICANT: UNIVERSITY OF NEW MEXICO 

; TITLE OF INVENTION: OUTCOME PREDICTION AND RISK CLASSIFICATION IN CHILDHOOD 

; TITLE OF INVENTION: LEUKEMIA 

; FILE REFERENCE: N12-038US/310 . 00050101 

; CURRENT APPLICATION NUMBER: US/10/729,895 

; CURRENT FILING DATE: 2003-12-05 

; PRIOR APPLICATION NUMBER: 60/510,904 

; PRIOR FILING DATE: 2003-10-14 

; PRIOR APPLICATION NUMBER: 60/510,968 

; PRIOR FILING DATE: 2003-10-14 

; PRIOR APPLICATION NUMBER: 60/432,064 

; PRIOR FILING DATE: 2002-12-06 

; PRIOR APPLICATION NUMBER: 60/432,077 

; PRIOR FILING DATE: 2002-12-06 

; PRIOR APPLICATION NUMBER: 60/432,078 

; PRIOR FILING DATE: 2002-12-06 

; NUMBER OF SEQ ID NOS : 18 

; SOFTWARE: Patentln Ver. 3.2 

; SEQ ID NO 16 

LENGTH: 4122 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-729-895-16 

Query Match 100.0%; Score 1080; DB 12; Length 4122; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1080; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db • 14 5 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 204 
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Qy 61 AGCT AC AT CT GT GAC AC AGG AC ACT GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT 120 

• I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I 111 I I II I I I I I I I I I I I 
Db 205 AGCT AC AT CT GT GAC AC AGG AC ACT GCT GTGGAC AGT CTC AGT GCT GC AACT ACT ACT AT 264 

Qy 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 
Db 265 GAACTC TGGT GGT T CT GGCT GGT GT GGACC AT CAT C AT CAT CCT GAGCT GCT GCTGTGTT 324 

Qy 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

I I I I 1 1 I I 1 1 1 1 I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I M 1 1 I I I I I I I I I I 1 1 
Db 325 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 384 

Qy 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 38 5 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 4 44 

Qy 301 CC AAAC T ATT TACT AC CT C CTT AT GAGG AAGT GGT G AACC GAC CT C C AACT C CT CC CC C A 360 

I I 1 1 1 1 I I I I I I I M I I I I I I I I I I I 1 1 I I I I I I I I I f t I I I I I I I I I I I I I I I I I 1 1 I I 

Db 44 5 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 504 

Qy 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 4 20 

1 1 I I 1 1 I I 1 1 I I I I I I I I I I I I I I I I I I I I M 1 1 I I I I I I I I I II I I I I I I I I I 1 1 I I 1 1 
Db 505 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 564 

Qy 421 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 4 80 

I I I I 1 1 I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 1 1 I I 1 1 
Db 565 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 624 

Qy 481 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 625 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 684 

Qy 541 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600. 

I I I I I I I I I I I I I I I I I i I I I i I I I I I I II I I I I I I I I I I I ,1 I I I I I I I I I I I I I I II I I 
Db 685 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 74 4 

Qy 601 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 74 5 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 804 

Qy 661 GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 7 20 

I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 805 GAGCT GCT GAAAGATGAC AGCTCT GAAC ACGGCGCACCCGAC AGCAAAGAGAAGACGC CT 864 

Qy 721 GGGAGAC ATC GC CGCT T C AC AGGT GACT CGGGCATT GAAGT GT GT GT GT GC AAC CGGGGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I II I I I I I I I I I I I I I I I I II I 
Db 865 GGGAGACATCGCCGCTTCACAGGT GACT CGGGCATT GAAGTGTGTGTGTGCAACCGGGGC 924 

Qy 781 CACC AT GACGAT GACCTC AAAGAGTT CAACACACTC AT CGAT GAT GCT CT GGAT GGGCCC 840 

I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I II I 
Db 925 CACC AT GACGAT GACCTC AAAGAGTT CAACACACTC AT CGAT GATGCT CT GGAT GGGC CC 984 

Qy 841 CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCT GGT GAT GAGG AGGAAGGCCTCTGT 900 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 985 CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 104 4 

Qy 901 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 960 

I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! II I I I I I I I I I 
Db 104 5 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 1104 

Qy 961 TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I II I I I I I I I I 
Db 1105 TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 1164 

Qy 1021 CCC AGCT AGAGC AGGT CCT GCCAGCACCCAGC AACT TGGCAAAGC AAC CAGGGTAGGGGA 1080 

I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1165 CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1224 



RESULT 4 

US-10-094-749-823 

; Sequence 823, Application US/10094749 

; Publication No. US200302 1974 1A1 

; GENERAL INFORMATION: 

; APPLICANT: ISOGAI, TAKAO 
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APPLICANT : SUGIYAMA, TOMOYASU 
APPLICANT: OTSUKI, TETSUJI 
APPLICANT: WAKAMATSU, AI 
APPLICANT: SATO, HIROYUKI 
APPLICANT: ISHII, SHIZUKO 
APPLICANT: YAMAMOTO, JUN-ICHI 
APPLICANT: ISONO, YUUKO 
APPLICANT: HIO, YURI 
APPLICANT: OTSUKA, KAORU 
APPLICANT: NAGAI , KEIICHI 
APPLICANT: IRIE, RYOTARO 
APPLICANT: TAMECHIKA, ICHIRO 
APPLICANT: SEKI, NAOHIKO 
APPLICANT: YOSHIKAWA, TSUTOMU 
APPLICANT: OTSUKA, MOTOYUKI 
APPLICANT: NAGAHARI, KEN JI 
APPLICANT: MASUHO, YASUHIKO 
TITLE OF INVENTION: NOVEL FULL-LENGTH cDNA 
FILE REFERENCE: 084335/0160 
CURRENT APPLICATION NUMBER: US/ 10/094 , 74 9 
CURRENT FILING DATE: 2002-03-12 
PRIOR APPLICATION NUMBER: 60/350,435 
PRIOR FILING DATE: 2002-01-24 
PRIOR APPLICATION NUMBER: JP 2001-328381 
PRIOR FILING DATE: 2001-09-14 
NUMBER OF SEQ ID NOS: 3381 
SOFTWARE: Pa ten tin Ver. 2.1 
SEQ ID NO 823 
LENGTH: 2703 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-094-749-823 

Query Match 99.7%; Score 1076.8; DB 7; Length 2703; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1078; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 5 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 204 

Qy 61 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 120 

I I I I I I I I I I 1 i I I I I I I II I I I I I I I I i I I I I I M I I I I I I I I I I I I I I I I I I ) I I ! I I 
Db 205 AGCT AC AT CT GT GACACAGGAC ACTGCT GT GGAC AGTCTC AGTGCT GC AACT ACT ACT AT 2 64 

Qy 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

I I I I I I I I 1 1 I I I I I M I I I I I I I 1 1 I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I 

Db 265 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 324 

Qy 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 325 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 384 

Qy 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 385 CTGATCGCTTACCGAGAAGCCCAC^iATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 4 44 

Qy 301 CC AAAC T ATT TACT ACCT CCTT AT GAGGAAGT GGT G AACCGACCT C C AACTC CT CC CC C A 360 

I I II I I I I I I I I I I! I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 44 5 CC AAAC T ATT TACT ACCT CCTT AT GAGGAAGT GGT GAACCGACCTCCAACTCCTCCCCCA 504 

Qy 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 420 

I I I I I I I I I I I I I I I I I I I I I I I 1. 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 505 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 564 

Qy 421 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 4 80 

. I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I 1 I I I I I I I I 
Db 565 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 624 

Qy 481 TT GT CT GAGC CC AGC AGAAGC AGC AC AAGACC CC C AAGC AT CGCT GAC CCT G AT CC CT CT 540 

I I I I I I I I I I 1 1 I I I M I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I 
Db 625 TT GTCT GAGC CCAGCAGAAGC AGC AC AAGACC CCCAAGC AT CGCT GAC CCT G AT CCCTCT 684 

Qy 541 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 685 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 744 
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Qy 


601 


GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 


660 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


745 


GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 


804 


Qy 


661 


GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


805 


GA GCT GCT GAAAGAT G AC AGCT CT GAAC AC GGCGC ACCCG AC AGC AAAGAGAAGAC GC CT 


864 


Qy 


721 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


780 






1 I 1 1 1 i 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 I 1 1 1 1 1 




Db 


865 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


924 


Qy 


781 


CACC AT GACGAT GACCT C AAAGAGTT CAAC AC ACTC AT CGAT GATGCT CT GGAT GGGC CC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 ! 1 1 1 1 1 1 ! 1 1 1 1 




Db 


925 


CACC AT GACGAT GACCTC AAAGAGTT CAAC AC ACT CAT CGAT GAT GCT CT GGAT GGGC CC 


984 


Qy 


841 


CT GGACTT CT GC GACAGCTGCC AT GT GCGGCCCCCT GGT GAT GAGGAGGAAGGC CT CT GT 


900 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


985 


CT GGACTT CT GC GACAGCTGCC AT GT GCGGCCCCCT GGT GAT GAGGAGGAAGGC CT CT GT 


1044 


Qy 


901 


CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 


960 






III 1 1 1 i 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 




Db 


1045 


CAGCCCT CT GAGGAGC AGGCT C GAGAGC CT GGGC AC C C GC AC CT GC C ACGGCC GCC CT CA 


1104 


Qy 


961 


TGCCTGCTGCTGAACACCAT CAACGAGCAGGACT CT CCCAACTCCC AGAGCAGC AGCT CC 


1020 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1105 


TGCCTGCT GCTGAACACC AT CAACGAGCAGGACTCT CC CAACTCCC AGAGCAGC AGCT CC 


1164 


Qv 


1021 


CCC AGCT AGAGCAGGTCCT GCC AGC ACCC AGC AACTTGGCAAAGCAACCAGGGTAGGGGA 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 




Db 


1165 


CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGC AACTTGGCAAAGCAACCAGGGTAGGGGA 


1224 


RESULT 


5 






US-10- 


291-172- 


-32 





Sequence 32, Application US/10291172 
Publication No. US20030228584A1 



; GENERAL INFORMATION: 
; APPLICANT: Hyseq, Inc 

; TITLE OF INVENTION: No. US2003O228584Alel Nucleic Acids and Polypeptides 
; FILE REFERENCE: 21272-04 5 

; CURRENT APPLICATION NUMBER: US/ 10/291 , 172 

; CURRENT FILING DATE: 2000-11-08 

; PRIOR APPLICATION NUMBER: 09/693,267 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 09/665,363 

; PRIOR FILING DATE: 2000-09-19 

; PRIOR APPLICATION NUMBER: 09/616,847 

; PRIOR FILING DATE: 2000-07-14 

; PRIOR APPLICATION NUMBER: 09/596,193 

; PRIOR FILING DATE: 2000-06-17 

; PRIOR APPLICATION NUMBER: 09/574,454 . ' 

; PRIOR FILING DATE: 2000-05-19 

; PRIOR APPLICATION NUMBER: 09/519,705 

; PRIOR FILING DATE: 2000-03-07 

; NUMBER OF SEQ ID N0S: 7 52 

; SEQ ID NO 32 

LENGTH: 1508 

TYPE: DNA 

ORGANISM: Homo sapiens ■ 
FEATURE: 
NAME /KEY : CDS 
LOCATION: (87).. (1175) 
US-10-291-172-32 

Query Match 97.6%; Score 1054.4; DB 7; Length 1508; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1055; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 25 CAGGAT AAGGAAGCCTGTGT GGGT ACCAAC AATC AAAGCT AC AT CT GT GACACAGGAC AC 84 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 174 CAGGAT AAGGAAGCCTGTGT GGGT ACCAACAATCAAAGCTACATCTGT GACACAGGAC AC 233 

Qy 85 T GCT GT GG AC AGT CTC AGT GCT GC AACT ACT ACT AT GAACT CT GGT GGTT CT GGCT GGT G 14 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
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Db 234 T GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT GAAC T CT GGT GGTT CT GGCT GGTG 293 

Qy 14 5 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 204 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I 1 I I I I I I I I I I I I I I I I I I I 
Db 294 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 353 

Qy 205 CGCC TT C AGGCC C AGC AGCGGC AAC AT GAAAT C AAC CT GAT C GCT T AC C G AGAAGC CC AC 264 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I 
Db 354 CGCCTTCAGGCCCAGC AGCGGC AACATGAAATCAACCTGATCGCTTACCGAGAAGCCC AC 413 

Qy 265 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 324 

I I 1 1 I I I I M I ! I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 414 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 4 73 

Qy 32 5 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 384 

M I I I I I I I I I I I I I I I I I I 1 1 1 1 I I I ! 1 1 I I i t I I I I I I I t I I I I I I I I I I I I I I I I I I 
Db 4 74 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 533 

Qy 385 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 4 44 

I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 534 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 593 

Qy 44 5 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCC AGC AGAAGC AGC 504 

I I I I I I I I I I I I I I I I I I I I I I I I t 1 I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I 
Db 594 CCCACCAGGGGATC CC AGGGGGCACAGAGC AGCCCCTT GT CT GAGC CC AGCAGAAGCAGC 653 

Qy 505 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 564 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 654 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 713 

Qy 565 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 624 

I II I I I I I I I I I I I I I I I I I II I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ' 
Db 714 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 7 73 

Qy 625 GGGGCCTT CCT GGAC AAAGAT GC AGAAT GT AGGG AGGAGCT GCT GAAAGAT GAC AGCT CT 684 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 
Db 774 GGGGCCTT CCT GGAC AAAGAT GC AGAAT GT AGGG AGGAGCT GCT GAAAGAT GAC AGCT CT 833 

Qy 685 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 744 

I I I I I I I I 1 1 I I I I I I I I I I I I I I 1 1 I I 1 1 1 1 1 1 1 1 I I 1 1 I I I I I I I I I I I I 1 1 1 1 1 1 M 

Db 834 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 893 

Qy 74 5 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 804 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 894 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 953 

Qy 805 TT CAAC AC ACTC AT CGATGATGCT CT GGAT GGGCCCCTGGACTTCTGCGACAGCTGCC AT 864 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 954 TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 1013 

Qy 865 GT GC GGCC CC CT GGT GAT GAGGAGGAAGGC CT CT GT C AGT CCT CT GAGGAGC AGGCTC GA 924 

I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1014 GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGCCCTCTGAGGAGCAGGCTCGA 107 3 

Qy 925 GAGC CT GGGCAC CCGCACCTGCCACGGC CGCC CGCATGCCT GCT GCT GAACACC AT CAAC 984 

I II I 1 1 I M I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1074 GAGC CT GGGCAC CC GCACCT GC CACGGC CGCC CGCAT GCCTGCT GCTGAAC ACC AT CAAC 1133 

Qy 985 GAGC AGGACTCTCCCAACTCCCAGAGCAGC AGCT CCCCCAGCTAGAGCAGGTCCTGCC AG 104 4 

I I I I II I I I I I I I I II I I I I II I I I I I II I I! II I I I I I I I I I I I I I I I I I I I I I 

Db 1134 GAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAG 1193 

Qy 104 5 C ACC C AGC AACT TGGCAAAGC AAC CAGGGTAGGGGA 1080 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1194 CACCCAGCAACTTGGCAAAGCAAC CAGGGTAGGGGA 122 9 



RESULT 6 

US-10-221-278-32 

; Sequence 32, Application US/10221278 
; Publication No. US2OO4O034 208A1 
; GENERAL INFORMATION: 
; APPLICANT: Hyseq, Inc 

; TITLE OF INVENTION: No. US2004 0034208Alel Nucleic Acids and Polypeptides 
; FILE REFERENCE: 21272-045 

; CURRENT APPLICATION NUMBER: US/ 10/221 , 278 
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; CURRENT FILING DATE : 2002-09-06 

; PRIOR APPLICATION NUMBER: 09/693,267 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 09/665,363 

; PRIOR FILING DATE: 2000-09-19 

; PRIOR APPLICATION NUMBER: 09/616,847 

; PRIOR FILING DATE: 2000-07-14 

; PRIOR APPLICATION NUMBER: 09/596,193 

; PRIOR FILING DATE: 2000-06-17 

; PRIOR APPLICATION NUMBER: 09/574,454 

; PRIOR FILING DATE: 2000-05-19 

; PRIOR APPLICATION NUMBER: 09/519,705 

; PRIOR FILING DATE: 2000-03-07 

; NUMBER OF SEQ ID NOS: 7 52 

; SEQ ID NO 32 

LENGTH: 1508 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: {87} . . (1175) 
US-10-221-278-32 

Query Match 97.6%; Score 1054.4; DB 8; Length 1508; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1055; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 25 CAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAAAGCTACATCTGTGACACAGGACAC 84 

I t I I I I I I I I I I I I I I I I I I I I t i I 1 I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I ! I I I 
Db 17 4 CAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAAAGCTACATCTGTGACACAGGACAC 2 33 

Qy 85 TGCTGTGGACAGTCTCAGTGCTGCAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTG 144 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 
Db 234 TGCTGTGGACAGTCTCAGTGCTGCAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTG 2 93 

Qy 14 5 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 204 

I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 1 I I I I I I I I I I I I I 
Db 294 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 353 

Qy 205 CGC CTT C AGGCC C AGC AGCGGC AACAT G AAAT C AAC CT GAT CGCTT AC CG AG AAGC CC AC 264 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I II I I 

.Db 354 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 413 

Qy 265 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 324 

I I I 1 I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 414 AATT AC TC AGCGCT GC C ATT TT AT T T C AGGTT TT T GCC AAACT AT T T ACT AC CT CCTT AT 4 73 

Qy 325 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 384 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 474 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 533 

Qy 385 CAGC AGCAGCT GCT GCCT CC AC AGT GT GGC CCTGC AGGT GGC AGT CCC C C GGGC AT CG AT 4 44 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ' 
Db 534 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 593 

Qy 4 45 C C C ACC AGGGGAT C CC AGGGGGCAC AGAGC AGCC CCT T GT CT GAGCCC AGC AGAAGCAGC 504 

II I I 1 1 1 1 I II I 1 1 II I I I I I I M 1 1 I I I I M I I I I I I I I 1 1 I I I I I I I I I I I I I I 1 1 I I 

Db 594 CC C ACC AGGGGAT CCC AGGGGGCAC AGAGC AGCC CCTT GTCT GAGCCC AGC AGAAGCAGC 653 

Qy 505 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 564 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 654 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 713 

Qy 565 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 624 

I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I 
Db 714 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 7 73 

Qy 625 GGGGCCTT CCTGGACAAAGATGCAGAAT GT AGGGAGGAGCTGCT GAAAGAT GAC AGCT CT 684 

I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 774 GGGGCCTTCCTGGACAAAGATGCAGAATGT AGGGAGGAGCTGCT GAAAGATGACAGCTCT 833 

Qy ' 685 GAAC AC GGCGC ACC CG AC AGC AAAGAGAAGAC GC CT GGGAGAC AT C GC CGCT T C AC AGGT 744 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 834 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 8 93 

Qy 74 5 GACT CGGGCATT GAAGTGTGTGTGTGCAAC CGGGGCCACCATGACGAT GACCTC AAAGAG 804 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 \ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 894 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 953 

Qy 805 TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 8 64 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 954 TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 1013 

Qy 865 GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGA 924 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1014 GT GC GGCCCCCT GGTGAT GAGGAGGAAGGC CT CT GT CAGC CCT CTGAGGAGC AGGCT CGA 107 3 

Qy 925 GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTG7\ACACCATCAAC 984 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I ! t I t I I I M I I I I I I f I I I I I I I I I I I I I I 
Db 1074 GAGC CT GGGC AC CC GC AC CT GC CACGGC CGCC CGC AT GCCT GCT GCT GAAC AC C AT C AAC 1133 

Qy 985 GAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAG 104 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1134 GAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAG 1193 

Qy 104 5 CACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1080 

I I I II I I I I I I I I I I I I t I i I I I I I I I I I I I I I I I I 
Db 1194 C A C C C AGC AACTT GGC AAAGC AAC C AGGGT AGGGGA 1229 



RESULT 7 
US-10-119-428 
Sequence 35 



-35 



Application US/10119428 
Publication No. US20030165881A1 
GENERAL INFORMATION : 



APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Asundi, Vinod 
APPLICANT: Xu, Chongjun 
APPLICANT: Wehrman, Tom 
APPLICANT: Ren, Feiyan 
APPLICANT : Ma, Yunqing 
APPLICANT: Zhou, Ping 

APPLICANT: Zhao, Qing A. ' 
APPLICANT: Yang, Yonghong 
APPLICANT: Drmanac, Radoje T. 

TITLE OF INVENTION: No. US20030165881Alel Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 789CIP2 

CURRENT APPLICATION NUMBER: US/ 10/1 19, 4 28 
CURRENT FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: 09/596,193 
PRIOR FILING DATE: 2000-06-17 
PRIOR APPLICATION NUMBER: 09/574,454 
PRIOR FILING DATE: 2000-05-19 
PRIOR APPLICATION NUMBER: 09/519,705 
PRIOR FILING DATE: 2000-03-07 
NUMBER OF SEQ ID NOS: 55 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 35 
LENGTH: 1550 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 
NAME/ KEY : CDS 
LOCATION: (129) . . (1220) 
US-10-119-428-35 



Query Match 97.6%; 
Best Local Similarity 99.9%; 
Matches 1055; Conservative 



Score 1054 .4; 
Pred. No. 0; 
0; Mismatches 



DB 7; Length 1550; 



Indels 



0; Gaps 



Qy 



Db 



25 CAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAAAGCTACATCTGTGACACAGGACAC 84 
I I I I I I I I I I I I I 1 .1 I II I I M I I I I I I I I I I I I I I i I I I I I I I I I ! I I I I I I I I I I I I I 
216 CAGGAT AAGGAAGCCTGT GT GGGT ACCAAC AATC AAAGCT AC AT CT GT GACACAGGAC AC 275 



Qy 

Db 

Qy 



8 5 T GCT GT GG AC AGT CT C AGT GCT GC AA CT ACT ACT AT GAACT CT GGT GGT T CT GGCT GGT G 144 
1 1 1 1 1 1 1 1 I I I I I I I I 1 1 I I M I I I I I I I I I I 1 1 1 1 I I 1 1 I I I I I I 1 1 1 1 I I II I I I I I I 
27 6. TGCTGTGGACAGTCTCAGT GCT GCAACTACTACT AT GAACT CT GGT GGT TCT GGCT GGT G 335 

14 5 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 204 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
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« Db 336 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 395 

Qy 205 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 264 

M I I I I I I I I I I I I I I I I I I I I 1 1 I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 396 CGCCT T C AGGCC CAGC AGCGGC AAC AT G AAAT C AACCTGATC GC T T AC CG AG AAGCCC AC 4 55 

Qy 265 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 324 

I i I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I II I I I I I I I I I I I I I I I I I I 
Db 4 56 AATT ACT C AGCGCT GC C ATTT T AT TT C AGGTT TT T GCC AAACT AT T T ACT AC CT CCTT AT 515 

Qy 325 GAGG AAGT GGT G AA CC GAC C T CC AACT C CT CC CC C ACC AT AC AGT GCCTT CC AGCT AC AG 384 

I I I I I I I I I I I I I I I 1 I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 516 GAGG AAGT GGT G AACC GACCT CC AACT C CT CC CC C ACC AT AC AGT GCCTT CC AGCT AC AG 575 

Qy 385 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 4 44 

I I I I I ! I I I I II I I I I I I I I I I I I I I II I I I I M I I I t I I I I I I I I I I f I I I I I I I I I t I 
Db 57 6 C AGC AGC AGCTGCT GC CT C C AC AGT GT GGC CCT GC AGGT GGC AGT C CC CC GGGC AT CGAT 635 

Qy 445 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 504 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 636 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 695 

Qy 505 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 564 

I I I I I I I I I I I I I I I i I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 

Db 696 AC AAGACCCCCAAGCATCGCTGACCCTGATCCCTCT GACCT ACC AGTTGACCGAGCAGCC 7 55 

Qy 565 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 624 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 756 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 815 

Qy 625 GGGGCCTT CCT GGACAAAGATGCAGAATCTAGGGAGGAGCTGCTGAAAGAT GAC AGCT CT 684 

I I I I I II I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 816 GGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTCT 875 

Qy 685 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 744 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 87 6 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 935 

Qy 74 5 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 804 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 936 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 995 

Qy 805 TT CAACAC ACT CAT CGAT GAT GCT CT GGAT GGGCCC CT GGACTT CT GC GACAGCTGCCAT 864 

I I I I I I I I I I I I I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 996 TT CAACAC ACT CAT CGAT GAT GCT CT GGAT GGGCCC CT GGACTT CT GC GACAGCTGCCAT 1055 

Qy 865 GT GCGGCC CC CT GGTGAT GAGGAGGAAGGC CT CT GT CAGT CCT CT GAG GAGC AGGCT C GA 924 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I 
Db 1056 GTGCGGCCCCCT GGTGAT GAGGAGGAAGGC CT CT GT CAGC CCT CT GAGGAGC AGGCTCGA 1115 

Qy 925 GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 984 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1116 GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTG7AACACCATCAAC 1175 

Qy 985 GAGC AGGACT CT CCCAACTCCCAGAGCAGC AGCT CCCCC AGCT AGAGC AGGT CCT GCC AG 104 4 

I I I I I I I I I I I I I Ill I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 117 6 GAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAG 1235 

Qy 104 5 C ACC CAGC AACT T GGC AAAGC AAC C AGGGT AGGGGA 1080 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I II 

Db 1236 CACCCAGCAACTTGGCAAAGCAACCAGGGT AGGGGA 1271 



RESULT 8 

US-09-814-353-20113 

; Sequence 20113, Application US/09814353 
; Publication No. US200301 65831A1 
; GENERAL INFORMATION: 
; APPLICANT: Lee, John 

APPLICANT: Thompson, Pamela 
; APPLICANT: Lillie, James 

; TITLE OF INVENTION: NOVEL GENES, COMPOSITIONS, KITS, AND METHODS FOR 
; TITLE OF INVENTION: IDENTIFICATION, ASSESSMENT, PREVENTION, AND 
; TITLE OF INVENTION: THERAPY OF OVARIAN CANCER 
; FILE REFERENCE: MRI-006B 

; CURRENT APPLICATION NUMBER: US/09/814,353 
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* ; CURRENT FILING DATE: 2001-03-21 

; PRIOR APPLICATION NUMBER: US 60/191,031 

; PRIOR FILING DATE: 2000-03-21 

; PRIOR APPLICATION NUMBER: US 60/207,124 

; PRIOR FILING DATE: 2000-05-25 

; PRIOR APPLICATION NUMBER: US 60/211,940 

; PRIOR FILING DATE: 2000-06-15 

; PRIOR APPLICATION NUMBER: US 60/216,820 

; PRIOR FILING DATE: 2000-07-07 

; PRIOR APPLICATION NUMBER: US 60/220,661 

; PRIOR FILING DATE: 2000-07-25 

; PRIOR APPLICATION NUMBER: US 60/257,672 

; PRIOR FILING DATE: 2000-12-21 

; NUMBER OF SEQ ID NOS : 22037 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 20113 

LENGTH: 3095 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE: 

NAME /KEY: misc_f ea ture 
LOCATION: 1, 2, 3, 3094, 3095 
OTHER INFORMATION: n = A,T,C or G 
US-09-814-353-20113 

Query Match 97.6%; Score 1054.4; DB 3; Length 3095; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1055; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



Qy 25 CAGGAT AAGGAAGCCT GT GT GGGTACCAACAATC AAAGCTACAT CT GT GACACAGGAC AC 84 

1 1 1 i 1 1 1 i 1 1 1 1 1 1 II 1 1 1 1 1 1 ll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I I l.l i 1 1 

Db 109 CAGGAT AAGGAAGCCT GT GT GGGTACCAACAATC AAAGCTACAT CT GT GACACAGGAC AC 168 

Qy 8 5 TGCTGTGGACAGTCTCAGTGCTGCAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTG 144 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 169 T GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT GAACT CT GGT GGTT CT GGCT GGTG 228 

Qy 14 5 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 204 

I I I I I I I I ! I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 229 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 288 

Qy 205 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 2 64 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 289 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 348 

Qy 265 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 324 

I I M I I I I I I I I I I 1 1 I I I I 1 1 1 1 1 1 1 1 I M I II 1 1 I I I I I I I I I I I I I I I I t I I I I I I I 

Db 34 9 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCA7UVCTATTTACTACCTCCTTAT 4 08 

Qy 32 5 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 384 

1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I i 1 1 1 1 M 1 1 1 1 1 1 

Db 409 GAGGAAGT GGT GAACC GACCT C CAACT C CT CC CC CACC AT AC AGT GCCTT CC AGCT AC AG 4 68 

Qy 385 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 4 44 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 69 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 528 

Qy 44 5 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 504 

I I I I I I I I I i I I I I I I I I I I II I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I 
Db 52 9 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 588 

Qy 505 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT GACCT ACCAGTTGACCGAGCAGCC 564 

I I I I I I i I I I I I I I I I I I I I I I I I I I | | | I I | I | I I I I I I I I I I I I I I I I I I I I I 

Db 589 AC AAGACC CC C AAGC AT C GCT GACCCTG AT CCCTCT GACCT ACCAGTTGACCGAGCAGCC 648 

Qy 565 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 624 

I I I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 64 9 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 708 

Qy 625 GGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCT GCT GAAAGATGAC AGCT CT 684 • 

I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I i I I I 
Db 709 GGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCT GCT GAAAGATGAC AGCT CT 7 68 

Qy 685 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 744 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 69 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 828 
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Qy 


74 5 


GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 


804 




i i i t i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i I I I I I I I i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 (I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


• 829 


GACTCGGGCATT GAAGTGTGTGTGTGCAACCGGGGC CACC AT GACGAT GACCTC AAAGAG 


888 


Qy 


805 


TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 


864 






1 1 1 1 1 1 1 1 1 1 l l l i i i l l l l l l l l l l l l l l l i l l l i i l l l f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 I 1 1 1 1 




Db 


889 


TT CAAC AC ACTC AT CGATGATGCT CT GGAT GGGCCCCT GGACTT CTGCGACAGCTGCC AT 


948 


Qy 


865 


GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGA 


924 






i i i ( t i i i i i i t i i i i i i i i i i i i i i i i i i i i i t t i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


94 9 


GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGCCCTCTGAGGAGCAGGCTCGA 


1008 


Qy 


925 


GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 


984 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t t t i i i i i i i i i t i i t i i i i i i i i i 
1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 1 1 II I 1 1 1 1 1 1 1 { I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 




Db 


1009 


GAGCCT GGGC ACCCGC ACCT GC CACGGCCGCC CGCATGCCTGCT GCTGAACACC AT CAAC 


1068 


Qy 


985 


GAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAG 


1044 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 




Db 


1069 


GAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAG 


1128 


Qy 


■ 1045 


CACC C AGC AACT T GGC AAAGC AAC C AGGGT AGGGGA 1080 








1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1129 


CACCCAGCAACTTGGCAAAGCAACCAGGGT AGGGGA 1164 




RESULT 


9 






US-10- 


729-895- 


•3 




; Sequence 3, 


Application US/10729895 




; Publication 


No. US20060063156A1 





GENERAL INFORMATION: 
APPLICANT : UNIVERSITY OF NEW MEXICO 



start | next page 

SCORE 1.3 BuildDate: 11/17/2006 
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SCORE Search Results Details for Application 

10729895 and Search Result 
20070109_162351_us-10-729-895a-l.rng. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 

20070109_162351_us-10-729-895a-l.rng. 

start I next page 

Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 
Run on : 



Title: 

Perfect score: 
Sequence : 



January 11, 2007, 01:00:29 ; Search time 781 Seconds 

(without alignments) 
9641.522 Million cell updates/sec 

US-10-729-895A-1 
1080 

1 atgcctttccttttgggtct aaagcaaccagggtagggga 1080 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 5244920 seqs, 3486124231 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 4 5 summaries 



10489840 



Database 



N_Geneseq_8 : * 



1 
2 
3 
. 4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 



geneseqnl980s : * 
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ALIGNMENTS 



RESULT 1 
ADP48574 

ID ADP48574 standard; DNA; 1080 BP. 
XX 

AC ADP4857 4; 
XX 

DT 09-SEP-2004 (first entry) 
XX 

DE Human OPAL1/G0 DNA splice form incorporating exon 1 SeqID 1. 
XX 

KW human; gene; ds; OPAL1; GO; childhood leukaemia; G protein beta 2; Gl; 

KW interleukin-10 receptor alpha; G2;' IL-10; FYN-binding protein; PBK1; 

KW cytostatic; acute lymphoblastic leukaemia; ALL; acute myeloid leukaemia; 

KW AML. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 1. .1029. 

FT /*tag= a 

FT /product= "OPAL1/G0 protein {incorporating exon 1)" 

XX 

PN WO2004053074-A2. 
XX 

PD 24-JUN-2004. 
XX 

PF 05-DEC-2003; 2003WO-US038738 . 
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XX 

PR . 06-DEC-2002; 2O02US-O4 32064 P . 

PR O6-DEC-2002; 2002US-04 32077P . 

PR 06-DEC-2002; 2002US-04 32078P . 

PR 14-OCT-2003; 2003US-0510904 P . 

PR 14-OCT-2003; 2003US-0510968P . 

PR 05-DEC-2003; 2003US-007298 95 . 

PR 05-DEC-2003; 2003US-0527610P. 
XX 

PA (SCTE-) SCI & TECHNOLOGY CORP @UNM. 

PA (SAND-) SAND I A CORP. 

XX 

PI Willman CL, Helman P, Veroff R, Mosquera-Caro M, Davidson GS; 

PI Martin SB, Atlas SR, Andries E, Kang H, Shuster JJ, Haaland DM; 

PI Potter JW, Wang X, Harvey R; 
XX 

DR WPI; 2004-468846/44. 

DR P-PSDB; ADP48575. 
XX 

PT New isolated Outcome Predictor in Acute Leukemia 1 (OPAL1) 

PT polynucleotides and polypeptides for preventing, treating or diagnosing 

PT leukemia, or for screening or evaluating compounds that may be used for 

PT treating leukemia. 

XX 

PS Claim 1; SEQ ID NO 1; 34 3pp; English. 
■ XX 

CC This invention relates to a novel isolated nucleic acid identified as 

CC OPAL1 (also known as GO), and the encoded protein thereof. Specifically, 

CC it refers to a method for predicting the outcome and risk classification 

CC of childhood leukaemia. The present invention describes a gene profiling 

CC method that can be used to determine the expression level for OPAL1, a 

CC gene that is strongly predictive regarding the outcome of both acute 

CC lymphoblastic leukaemia (ALL) and acute myeloid leukaemia (AMD - 

CC Furthermore, it provides additional genes namely G protein beta 2 

CC (referred to as Gl), interleukin-10 (IL-10) receptor alpha (referred to 

CC as G2), the FYN-binding protein and PBK1 that can also be used, alone or 

CC in combination, in predictive assays for the classification and potential 

CC therapy for a leukaemia patient. Accordingly, these compositions exhibit 

CC cytostatic activities. This polynucleotide sequence is the human OPAL1/GO 

CC splice from incorporating exon 1 of the invention. 
XX 

SQ Sequence 1080 BP; 24 9 A; 341 C; 292 G; 198 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1080; DB 12; Length 1080; 
Best Local Similarity 100.0%; Pred. No. 2.9e-278; 

Matches 1080; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

I I I I I I II I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

Qy 61 AGCT AC AT CT GT GAC AC AGG AC ACT GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

Db 61 AGCT AC AT CT GT GA C AC AGG AC ACT GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT 120 

Qy 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

Qy 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 

Db 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

Qy 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M it 1 1 1 1 1 1 1 1 1 1 1 

Db 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

Qy 301 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 360 

I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 301 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 360 

Qy 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 4 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I 

Db 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 4 20 

Qy 421 GGTGGC AGTCCCCCGGGC AT CGAT CCCACC AGGGGATCCC AGGGGGCACAGAGC AGCCCC 4 8 0 
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I I I I 1 1 1 1 I I I I I I I I I I I I I I t I I II I I I M I I I I M I I I M I I I I I I I I I I I I I I I I I 
Db 421 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 4 80 

Qy 481 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 540 

Qy 54 1 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600 

Qy 601 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 660 

I I I I 1 1 1 1 1 1 I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 1 1 I I 
Db 601 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 660 

Qy 661 GAGCTGCTGAAAGATGACAGCT CT GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 720 

I I I I I I M I I I I I II I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 GAGCTGCT GAAAGAT GAC AGCT CT GAAC ACGGCGCACCCGAC AGCAAAGAGAAGAC GCCT 7 20 

Qy 721 GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 780 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 721 GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 780 

Qy 781 CACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGATGATGCTCTGGATGGGCCC 840 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 781 CACC ATGACGATGACCTCAAAGAGTTCAAC ACACTC ATCGAT GATGCT CT GG AT GGGC CC 840 

Qy 841 CT GGACTT CT GCGACAGCTGCCAT GT GCGGCC CCCT GGTGAT GAGGAGGAAGGCCT CT GT 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I 
Db 841 CT GGACTT CTGCGACAGCTGCC AT GT GCGGCC CCCT GGTGAT GAGGAGGAAGGCCT CT GT 900 

Qy 901 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 960 

Qy 961 TGCCTGCT GCT GAACACC AT CAACGAGC AGGACT CT CC CAACTCCC AGAGCAGC AGCT CC 1020 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 961 TGCCTGCT GCTGAACACCAT CAACGAGC AGGACT CT CCCAACTCCCAGAGCAGC AGCT CC 1020 

Qy 1021 CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1080 

I I I I I I I I I 1 I I I I I I I I I I I I I I I I II I I I I I I I I I I I I ! I I I I I I I I I I t I I 1 I I I I I 
Db 1021 CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1080 



RESULT 2 
AAI72319 

ID AAI72319 standard; cDNA; 2976 BP. 
XX 

AC AAI72319; 
XX 

DT 15-APR-2002 (first entry) 
XX 

DE ISIGP-2 cDNA. 
XX 

KW Human; intracellular signalling protein; ISIGP; gene; cell proliferation; 
KW autoimmune; inflammation; gastrointestinal disorder; 
KW reproductive disorder; developmental disorder; ss . 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 479. .1507 

FT /*tag= a 

FT /product= "ISIGP-2" 

XX 

PN WO200194391-A2. 
XX 

PD 13-DEC-2001. 
XX 

PF 07-JUN-2001; 2001WO-US018595 . 
XX 

PR 08-JUN-2000; 2000US-02 10582P . 
PR 16-JUN-2000; 2000US-021244 3P . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 
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PI Yue H, He A, Nguyen DB, Yao MG, Bandman O, Burford N, Tang YT; 

PI Xu Y, Hafalia A, Azimzai Y, Walia NK; 

XX 

DR WPI; 2002-154564/20. 

DR P-PSDB; AAB47872. 
XX 

PT New human intracellular signaling protein and polynucleotides useful for 

PT diagnosing, treating or preventing cell proliferative, 

PT autoimmune/ inflammatory, gastrointestinal, reproductive and developmental 

PT disorders. 

XX 

PS Claim 5; Page 102-03; 106pp; English. 
XX 

CC The sequences given in AAI72318-22 encode novel human intracellular 

CC signalling proteins (ISIGP). The polynucleotides and ISIGP proteins may 

CC be used for the diagnosis, treatment or prevention of cell proliferative, 

CC autoimmune/inflammatory, gastrointestinal, reproductive and developmental 

CC disorders ., The protein encoded by this sequence has homology to human WW 

CC domain binding protein-1 

XX 

SQ Sequence 2976 BP; 633 A; 817 C; 823 G; 703 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1080; DB 6; Length 2976; 

Best Local Similarity 100.0%; Pred. No. 4.4e-278; 

Matches 1080; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 479 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 538 

Qy 61 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I t I I I I I I I I 
Db 539 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 598 

Qy 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I 
Db 599 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 658 

Qy 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I II I I I I I I I I I I II I I 
Db 659 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 718 

Qy 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

I I I I I I I I I I I I t I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 719 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 778 

Qy 301 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 360 

I I I I I I I I I I I I I I 1 1 I I I I I I I I I I 1 1 I I I I I N I I I 1 1 I I I I I I I I I I I I I I I I I I I I 

Db 779 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 838 

Qy 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 4 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 839 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 898 

Qy 421 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 4 80 

I I I ! I I II I I I I I I ! I I I I I I II I I I I I I I I I I I It 1 1 1 1 I I I I I I I I I I I ! I I I I I I I I 
Db 899 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 958 

Qy 481 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 540 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I ! I I I I I I I I I I II I I I I I I I I I I I 
Db 959 TT GT CT GAGC CC AGC AGAAGC AGC AC AAGACC CC CAAGC AT C GCT GAC CCT G AT CC CT CT 1018 

Qy 541 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1019 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 1078 

Qy 601 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 660 

I I I I I I I I I I t I I I I I I I I I It I I I I I I I I I I I I I I I I I ( I I I I I I I I I I I I I I I II I I I 

Db 1079 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 1138 

Qy 661 GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 720 

I I I I I I I I I I I i I I I I I I I I I I II I I I M I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I i I I 

Db 1139 GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 1198 

Qy 721 GGGAGAC ATC GC CGCT T C AC AGGT GACT CGGGCATT GAAGT GT GT GT GT GC AAC CGGGGC 7 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 


1199 


GGGAGAC AT CGC CGCT T C AC AGGT GACT CGGGC AT T GAAGT GT GTGT GT GC AAC CGGGGC 


1258 


Qy 


781 


CACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGATGATGCTCTGGATGGGCCC 


840 






1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 ! 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 1 i 1 1 t I 1 1 It 1 1 1 1 1 1 




Db 


1259 


C ACC AT GACGAT GACCT C AAAG AGTT CAAC AC ACTCAT CGAT GATGCTCT GGAT GGGC CC 


1318 


Qy 


841 


CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 


900 




1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1319 


CT GGACTTCT GCGACAGCTGCC AT GT GC GGCCCCCT GGTGAT GAGGAGGAAGGCCT CT GT 


1378 


Qy ' 


901 


CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 


960 




Ill 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I 1 t 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


•1379 


CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 


1438 


Qy 


961 


T GC C T GCT GCT G AAC AC C AT C AAC GAGCAGGACTCTCCCAACTCCC AG AGC A GCAGCTCC 


1020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1439 


T GCCT GCT GCT GAAC ACC AT CAAC GAGCAGGACTCTCCCAACTCCC AG AGC AGC AGCTCC 


1498 


Qy 


1021 


CCCAGCTAGAGC AGGT CCTGCC AGC ACC CAGCAACTTGGCAAAGC AAC CAGGGTAGGGGA 


1080 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1499 


CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 


1558 



RESULT 3 
ADP48589 



ID ADP48589 standard; cDNA; 4122 BP. 
XX 

AC ADP48589; 
XX 

DT O9-SEP-2004 (first entry) 
XX 

DE Full length human OPAL1 cDNA SeqID 16. 
XX 

KW human; ss; OPAL1; GO; childhood leukaemia; G protein beta 2; Gl; 

KW interleukin-10 receptor alpha; G2; IL-10; FYN-binding protein; PBK1; 

KW cytostatic; acute lymphoblastic leukaemia; ALL; acute myeloid leukaemia; 

KW AML. 

XX 

OS Homo sapiens. 



XX 

PN WO2004053074-A2. 
XX 

PD . 24-JUN-2004 . 
XX 

PF 05-DEC-2003; 2003WO-US0387 38 . 
XX 

PR 06-DEC-2002; 2002US-04 32064P . 

PR 06-DEC-2002; 2002US-04 32077P . 

PR 06-DEC-2002; 2002US-04 32078P . 

PR 14-OCT-2003; 2003US-0510904P . 

PR 14-OCT-2003; 2003US-0510968P . 

PR 05-DEC-2003; 2003US--007298 95 . 

PR 05-DEC-2003; 2003US-0527610P . 
XX 

PA (SCTE-) SCI & TECHNOLOGY CORP @UNM. 

PA (SAND-) SANDIA CORP. 

XX 

PI Willman CL, Helman P, Veroff R, Mosque ra-Caro M, Davidson GS; 

PI Martin SB, Atlas SR, Andries E, Kang H, Shuster JJ, Haaland DM; 

PI Potter JW, Wang X, Harvey R; v 
XX 

DR WPI; 2004-468846/44. 
XX 

PT New isolated Outcome Predictor in Acute Leukemia 1 (OPAL1) 

PT polynucleotides and polypeptides for preventing, treating or diagnosing 

PT leukemia, or for screening or evaluating compounds that may be used for 

PT treating leukemia. 

XX 



PS Disclosure; SEQ ID NO 16; 343pp; English. 
XX 

CC This invention relates to a novel isolated nucleic acid identified as 

CC OPAL1 (also known as GO), and the encoded protein thereof. Specifically, 

CC it refers to a method for predicting the outcome and risk classification 

CC of childhood leukaemia. The present invention describes a gene profiling 

CC method that can be used to determine the expression level for OPAL1, a 

CC gene that is strongly predictive regarding the outcome of both acute 1 
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CC lymphoblastic leukaemia (ALL) and acute myeloid leukaemia (AMD - 

CC Furthermore, it provides additional genes namely G protein. beta 2 

CC (referred to as Gl ) , interleukin-10 (IL-10) receptor alpha (referred to 

CC as G2), the FYN-binding protein and PBK1 that can also be used, alone or 



CC in combination, in predictive assays for the classification and potential 

CC therapy for a leukaemia patient. Accordingly, these compositions exhibit 

CC cytostatic activities. This polynucleotide sequence is the full length 

CC human 0PAL1 cDNA of the invention. 
XX 

SQ Sequence 4122 BP; 964 A;' 1066 C; 1075 G; 1017 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1080; DB 12; Length 4122; 

Best Local Similarity 100.0%; Pred. No. 5.1e-278; 

Matches 1080; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



vy 


i 


ATPCPTTTPPTTTTP^GTPTTAPAPAGGATAAP^AAPPPTPTPTPPPTAPPAAPAATPAA 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 II 1 1 1 1 1 1 1 1 1 1 t I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 II II 1 M 1 1 1 1 1 1 1 M 1! 1 1 1 1 M 1 1 M 1 1 1 II II 1 1 1 1 II M 1 1 II II M 1 1 M 




Db 


145 


AT GC CT T T CCT TT T GGGT CT T AGAC AGGAT AAGG AAGC CT GT GT GGGT AC C AAC AAT C AA 


204 


Ov 


61 


AGCT AC AT CT GT GAC AC AGG AC ACT GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT 


120 






1 1 1 1 1 1 1 1 1 \ 1 1 1 I 1 1 1 I 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


205 


AGCT AC AT CT GT GAC AC AGG AC ACT GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT 


264 




121 


pa a r*T rT ppt ryiTT pt nncrnaT pt c pa pp at c a t p at p a t p pt p a pp t ppt ppt pt ptt 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


265 


GAACTCTGGT GGTT CT GGCTGGTGTGGACC AT CATCAT CATCCT GAGCT GCT GCT GTGTT 


324 


VY 


181 


tppp arr flrrfTrcai^rr a app ArTT^ppTTPAryzpppflnr' af^rnnr a apatpa a at pa ap 


240 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


325 


TGCCAC CACCGCCGAGCC AAGC ACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATC AAC 


384 


Ov 

vy 


24 1 


PTPATPPPTT APPPt AP A AHPPPAP A ATT APTPAPPPPT PPP ATT TT ATTTP A PPTTTTTP 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 ! II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


385 


CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 


444 


vy 


301 


PP A A APT ATTT APT APPTPPTT AT PA PPA APT PPTPA A PPPAPPTPP A APTPPTPPPPP A 


360 






1 1 1 1 1 1 1 ! 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 

1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


445 


CC AAACT AT T TACT AC CT C CTT AT GAGG AAGT GGT G AAC C GAC CT C C AAC T C CT C C CC CA 


504 


Ov 


361 


PPAT APAPTPPPTTPPAPPTAPAPPAPrPAPPAPPTPPTPPPTPPAPAPTPTPPPPPTPPA 


4 20 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


505 


CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 


564 


Ov 

vy 


421 


PPTPPPAPTPPPPPPPGPATPGATPPPAPPAPPPPATPPPAPPPPPPAPAPAPPAPPPPP 


4 80 






i i i i i i i i i i i ii i i i i i i i I i 1 1 I i i i i i 1 1 i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
i i i i i i i i i i i i i i i i i i i i i i 1 1 i i i i 1 1 1 1 i i i i i i i i i i i i i i i i i i i i i t i i i i i i 




Db 


565 


GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 


624 


Ov 
vy 


481- 


TTGTCTGAGCCCAGCAGAJVGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 


540 






1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


625 


TT GT CT GAGC CC AGC AGAAGC AGC AC AAGAC C CC C AAGC AT C GCT GAC CCT GAT CC CT CT 


684 


Ov 
vy 


541 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 T l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 ! 




Db 


685 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


744 


Qy 


601 


GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 


660 






1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


745 


GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 


804 


Qy 


661 


GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 


720 






1 1 1 M 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 




Db 


805. 


GAGCT GCT GAAA GAT GAC AGCT CTGAAC AC GGCGCACCCGACAGCAAAGAGAAGACGCCT 


864 


Qy 


■ 721 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


780 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


865 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


924 


Qy 


781 


CACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGATGATGCTCTGGATGGGCCC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


925 


C ACC AT GACGAT GACC T C AAAG AGTT C AAC AC ACT CAT C GAT GAT GCT CT GG AT GGGC CC 


984 


Qy 


841 


CTGGACTTCTGCGACAGCTGCCATCTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 


900 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II II i 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 




Db 


985 


CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 


1044 


Qy 


901 


CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 


960 



htip://es/ScoreAccessWeb/Gefltem.action?AppId=10729895&seqId=1038735&ItemName... 1/22/2007 



1 1 I I 1 1 1 1 1 1 I I I I I M I I I I I I I I I I I 1 1 I I I M I I I I I I I I I I I I I I I I I I I I I I II I 



Db 


1045 


CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 


1 104 


Qy 


961 


TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 


1020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1105 


TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 


1164 


Qy 


1021 


CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 


1080 






1 t 1 1 t 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 




Db 


1165 


CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 


1224 



RESULT 4 
ADA53255 

ID ADA53255 standard; cDNA; 2703 BP. 
XX 

AC ADA53255; 
XX 

DT 20-NOV-2003 (first entry) 
XX 

DE Human coding sequence, SEQ ID 823. 
XX 



KW Cytostatic; Anti-inflammatory; Osteopathic; Neuroprotective; Nootropic; 

KW Gene Therapy; human; secretory protein; membrane proteins; cancer; 

KW inflammatory disease; osteoporosis; neurological disease; gene; ss. 
XX 

OS Homo sapiens. 
XX 

PN EP1293569-A2. 
XX 

PD 19-MAR-2003. 
XX 

PF .21-MAR-2002; 2002EP-00006586 . 
XX 

PR 14-SEP-2001; 2001 JP-00328381 . 

PR 24-JAN-2002; 2002US-03504 35P . 
XX 

PA (HELI-) HELIX RES INST. 

PA (REAS-) RES ASSOC BIOTECHNOLOGY. 

XX 

PI Isogai T, Sugiyama T, Otsuki T, Wakamatsu A, Sato H, Ishii S; 

PI Yamamoto J, Isono Y, Hio Y, Otsuka K, Nagai K, Irie R, Tamechika I; 



PI Seki N, Yoshikawa T, Otsuka M, Nagahari K, Masuho Y; 
XX 

DR WPI; 2003-395539/38. 

DR P-PSDB; ADA548 94 . 
XX 

PT New polynucleotides encoding full-length polypeptides, e.g. secretory 

PT and/or membrane proteins, useful for developing medicines for diseases in 

PT which the gene is involved, or as target molecules for gene therapy. 

XX 

PS Claim 1; SEQ ID NO 823; 205pp; English. 
XX 

CC The present invention relates to novel human secretory or membrane 

CC proteins (ADA54 072-ADA55710 ) and their coding sequences (ADA524 33- 

CC ADA54071) . The coding sequences are useful in the gene therapy of 

CC diseases caused by abnormalities of the proteins, e.g. cancer, 

CC inflammatory diseases, osteoporosis or neurological disease. 
XX 

SQ Sequence 2703 BP; 599 A; 74 2 C; 713 G; 64 9 T; 0 U; 0 Other; 



Query Match 99.7%; Score 1076.8; DB 10; Length 2703; 

Best Local Similarity 99.8%; Pred. No. 3.1e-277; 

Matches 1078; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 


1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


145 


ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 


204 


Qy 


61 


AGCT ACAT CTGT GACACAGGAC ACTGCT GT GGACAGTCTC AGTGCT GC AACT ACT ACT AT 


120 






1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 




Db 


205 


AGCT ACAT CTGT GACACAGGAC ACTGCT GT GGACAGTCTC AGTGCT GC AACT ACT ACT AT 


264 


Qy 


121 


GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 


180 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 II I 1 




Db 


265 


GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 


324 
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Qy 


181 


TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 


240 






1 1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I 1 t 1 1 1 1 ! 1 I I 1 1 1 1 I 1 1 i 1 I 1 1 ! 1 




Db 


325 


TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 


384 


Qy 


241 


CT GATC GCTT AC CGAG AAGC CC AC AATT ACTC AGCGCT GC C ATT TT AT TT C AGGTT TT T G 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 If II 1 




Db 


38 5 


CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 


444 


Qy 


301 


CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


445 


CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


504 


Qy 


361 


CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 


4 20 






1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


505 


CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 


564 


Qy 


421 


GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


565 


GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 


624 


Qy 


481 


TT GT CT GAGC C C AGC AGAAGC AGC AC AAGA CC C C C AAGC AT C GCT G AC C CT G AT CC CT CT 


540 






i ii ii 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 n M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


625 


TT GT CT GAGC CC AGCAGAAGC AGC AC AAGACCCC C AAGC AT C GCT G AC C C T GAT CC CT CT 


684 


Qy 


541 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


600 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


685 


GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 


74 4 


Qy 


601 


GCTGGCCT GGGGGAGCTGGACCCGGGGGCCTT CCTGGACAAAGATGCAGAAT GT AGGGAG 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


74 5 


GCTGGCCT GGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGT AGGGAG 


804 


Qy 


661 


GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 


720 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


805 


GAGCTGrTGAAAGATGArAGCTCTGAACACGGrGrArrrGACAGCAAAGAGAAGArGCrT 


8 64 


Qy 


721 


GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 t I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


865 


GGGAGACATCGCCGCTT CACAGGT GACT CGGGCATT GAAGTGTGT GTGTGCAACCGGGGC 


924 


Qy 


781 


CACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGATGATGCTCTGGATGGGCCC 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


925 


CACCAT GACGAT GACCTCAAAGAGTTCAACACACTCAT CGATGATGCTCTGGATGGGCCC 


984 


Qy 


841 


CT GGAC TT CT GC GAC AGCT GCC AT GT GC GGCC CC CT GGT GAT GAGGAGGAAGGC CT CT GT 


900 






1 1 1 1 t 1 1 1 1 1 1 1 1 1 f 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


98 5 


CT GGACTT CT GCGACAGCTGCC AT GTGCGGCCCCCT GGT GAT GAGGAGGAAGGC CT CT GT 


1044 


Qy 


901 


CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 


960 






III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 i 1 1 1 1 1 1 II 




Db 


104 5 


CAGCCCTCTGAGGAGC AGGCTCGAGAGCCT GGGC ACCCGC ACCT GCCACGGCCGCCCT CA 


1 104 


Ov 


961 


T GCC TGCT GCT GAAC ACC AT CAACGAGC AGGACT CT CC CAACT C CC AG AGC AGC AGCT CC 


1020 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 ! 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 




Db 


1105 


TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 


1164 


Qy 


1021 


CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 


1080 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1165 


CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 


1224 



RESULT 5 
AAS44951 



ID AAS44951 standard; cDNA; 1508 BP. 
XX 

AC AAS44951; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE cDNA encoding novel human secretory protein, Seq ID No 32. 
XX 

KW Human; secreted protein; arthritis; Crohn's disease; sepsis; shock; 

KW ischaemia-reperf usion injury; haematopoiesis; cancer; neuropathy; 

KW transgenic animal; Alzheimer's disease; Parkinson's disease; burn; 

KW amyotrophic lateral sclerosis; platelet disorder; thrombocytopenia; 
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KW ulcer; osteoporosis; bone degenerative disorder; periodontal disease; 

KW gut protection; lung; liver fibrosis; immune deficiency; infection; 

KW severe combined immunodeficiency; SCID; autoimmune disorder; allergy; 

KW multiple sclerosis; rheumatoid arthritis; diabetes mellitus; asthma; 

KW fertility; analgesic; pain; antigen; ss . 
XX 

OS Homo sapiens. 
XX 

PN WO200166689-A2 . 
XX 

PD 13-SEP-2001. 
XX 

PF 05-MAR-2001; 2001WO-US004 94 2 . 
XX 

PR 07-MAR-2000; 2000US-0051 9705 . 

PR 19-MAY-2000; 2O00US-005744 54 . 

PR 17-JUN-2000; 2000US-00596193 . 

PR 14-JUL-2000; 2000US-00616847 . 

PR 19-SEP-2000; 2000US-00665363 . 

PR 20-OCT-2000; 2000US-00693267 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT , Liu C, Asundi V, Xu C, Wehrman T, Ren F, Ma Y, Zhou P; 

PI Zhao QA, Yang Y, Drmanac RT, Zhang J, Chen R, Xue AJ, Wang J; 
XX 

DR WPI; 2001-589934/66. 

DR P-PSDB; AAU28051 . 
XX 

PT Novel polypeptides and nucleic acids obtained from cDNA libraries 

PT prepared from various human tissues, for diagnosis and treatment of 

PT cancer, neurological, inflammatory, • and autoimmune disorders. 
XX 

PS Claim 1; SEQ ID NO 32; 107pp; English. 
XX 

CC The invention relates to novel isolated human secreted polypeptides {I) 

CC and polynucleotides (II). (I) and (II) are useful for treating 

CC inflammatory conditions such as arthritis, nephritis, Crohn's disease, 

CC ischaemia-reperf usion injury, shock, sepsis, immune responses, and is 

CC involved in increasing haematopoiesis , stem cell survival, bone growth 

CC and remodeling. (I), (II) and modulators of (II) are useful for 

CC prophylaxis or treatment of one or more cancers. (II) is also useful for 

CC creating transgenic animals useful for studying the in vivo activities of 

CC the polypeptide as well as for studying modulators of the polypeptides. 

CC (I) induces the proliferation of neural cells and regeneration of nerve 

CC and brain tissue and is useful for the treatment of central and 

CC peripheral nervous system diseases and neuropathies, such as Alzheimer's, 

CC Parkinson's disease, Huntington's disease, and amyotrophic lateral 

CC sclerosis. In addition, (I) is involved in chemotactic or chemokinetic 

CC activity, regulation of haematopoiesis and is useful for treating myeloid 

CC or lymphoid cell disorders, platelet disorders such as thrombocytopenia 

CC and for regeneration of bone, cartilage, tendon, ligament and/or nerve 

CC tissue growth, and in tissue repair, healing of burns, incisions, ulcers, 

CC for treating osteoporosis, osteoarthritis, bone degenerative disorders, 

CC or periodontal disease. Furthermore, (I) is also useful for gut 

CC protection or regeneration and treatment of lung or liver fibrosis, 

CC reperfusion injury in various tissues, various immune deficiencies and 

CC disorders including severe combined immunodeficiency (SCID), bacterial or 

CC fungal infections, autoimmune disorders e.g. multiple sclerosis, 

CC rheumatoid arthritis, diabetes mellitus, myasthenia gravis, allergic 

CC reactions and conditions, such as asthma or other respiratory problems. 

CC In addition, (I) affects biorhythms or circadian cycles of rhythms, 

CC fertility, metabolism, catabolism, anabolism, storage or elimination of 

CC dietary fat, lipid, protein, carbohydrate, vitamins, minerals, provides 

CC analgesic effects or other pain reducing effects, immunoglobulin like 

CC activity and can act as an antigen in a vaccine composition to raise an 

CC immune response. AAS4 4 920-AAS4 5295 represent novel human secreted protein 

CC coding sequences of the invention 

XX 

SQ Sequence 1508 BP; 345 A; 447 C; 423 G; 293 T; ' 0 U; 0 Other;. 

Query Match 97.6%; Score 1054.4; DB 5; Length 1508; 

Best Local Similarity 99.9%; Pred. No. 2.4e-271; 

Matches 1055; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 25 CAGGAT AAGGAAGCCT GT GT GGGT ACCAACAATC AAAGCT ACATCT GT GACAC AGGAC AC 84 
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1 1 I I I I I I I I I I I I I I I I I I I I I II I I I I I 1 1 1 1 1 1 1 1 I I I I I I I I I I I M I I I I I I I I I 

Db 174 CAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAAAGCTACATCTGTGACACAGGACAC 233 

Qy 85 T GCT GT GG AC AGTCT C AGT GCT GC AACT ACT ACT AT GAACT CT GGT GGTT CT GGCT GGT G 144 

I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
Db 234 TGCTGTGGACAGTCTCAGTGCTGCAACTACTACTATGAACTCTGGT GGTT CT GGCT GGTG 293 

Qy 14 5 T GGACC AT CAT CAT CAT C CT GAGCT GCT GCTGTGT T T GC C ACC ACC GC CG AGCC AAGC AC 204 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 294 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 353 

Qy 205 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 264 

I I ! I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 354 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 413 

Qy 265 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 324 

1 1 1 1 1 ji 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 414 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 4 73 

Qy 325 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 384 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 474 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 533 

Qy 385 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 4 44 

I I I I I I I I I I 1 1 I I I I 1 1 1 1 I I 1 1 I I I I I I I I M I I I I I I I I I I I I I I I I I I 1 1 I I I I II 
Db 534 C AGC AGC AGCT GCT GC CT CC AC AGT GT GGC CCTGC AGGT GGC AGT C C C CC GGGC AT C GAT 593 

Qy 44 5 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 504 

I I I II I II I I I I I I I I I.I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 594 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 653 

Qy 505 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 5 64 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 654 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 713 

Qy 565 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 624 

I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I ! 
Db 714 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 773 

Qy 625 GGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTCT 684 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 774 GGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTCT 833 

Qy 685 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 744 

I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I J II I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 834 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 8.93 

Qy 74 5 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 804 

I I I II I I I I I I I I I I I I I I I I I I I 1 I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 894 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 953 

Qy 805 TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 864 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I 
Db 954 TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 1013 

Qy 865 GT GC GGCCCCCT GGTGAT GAGGAGGAAGGCCT CT GT CAGT CCTCTGAGGAGCAGGCT CGA 924 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I 
Db 1014 GTGC GGCCCCCT GGTGAT GAGGAGGAAGGCCTCTGTCAGCCCTCTGAGGAGCAGGCTCGA 1073 

Qy 925 GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 984 

I I I I I I I I I I I I I illlll I I II I I I I I I I I I I I I I I 

Db 1074 GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 1133 

Qy 985 GAGCAGGACTCTCCCAACTCCCAGAGCAGC AGCT CCCCCAGCTAGAGC AGGT CCTGCCAG 104 4 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I t I I I I I I I ! I ! I I I I I I I I 
Db 1134. GAGCAGGACTCTCCCAACTCCCAGAGCAGC AGCT CCCCCAGCTAGAGC AGGT CCTGCCAG 1193 

Qy 104 5 C ACC C AGC AACT T GGC AAAGC AACC AGGGT AGGGGA 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1194 CACCCAGCAACTTGGCAAAGCAACCAGGGT AGGGGA 122 9 



RESULT 6 
ADL61901 

ID ADL61901 standard; DNA; 3095 BP. 
XX 
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AC ADL61901; 
XX 

DT 20-MAY-2004 (first entry) 
XX 

DE Human ovarian cancer DNA marker #20113. 
XX 

KW Human; ovarian cancer; ds; tumour; cytostatic; DNA marker. 
XX 

OS Homo sapiens. 
XX 

PN WO200170979-A2. 
XX 

PD 27-SEP-2001. 
XX 

PF 21-MAR-2001; 2001WO-US009126 . 
XX 

PR 21-MAR-2000; 2000US-01 91031 P . 

PR 25-MAY-2000; 2000US-0207124P. 

PR 15-JUN-2000; 2000US-0211940P. 

PR 07-JUL-2000; 2000US-02 16820P . 

PR 25-JUL-2000; 2000US-0220661P. 

PR 21-DEO2000; 2000US-0257672P . 
XX 

PA (MILL-) MILLENNIUM PREDICTIVE MEDICINE INC. 
XX 

PI Lee J, Lillie J; 
XX 

DR WPI; 2001-611502/70. 
XX 

PT Novel isolated nucleic acid molecules (markers) overexpressed in ovarian 

PT cancer cells as compared to their normal non-cancerous ovarian cells are 

PT used to characterize stage, grade, histological type of ovarian cancer. 
XX 

PS Disclosure; SEQ ID NO 20113; 106pp; English. 
XX 

CC The invention relates to nucleic acid markers which are overexpressed in 

CC ovarian cancer cells as compared to their expression in normal (i.e. non- 

CC cancerous) ovarian cells. The invention also relates to polypeptides 

CC encoded by the markers, antibodies that selectively bind to the 

CC polypeptides, a method of inhibiting ovarian cancer in a patient at risk 

CC of developing ovarian cancer involving inhibiting expression of a gene 

CC corresponding to a marker of the invention and a method of treating a 

CC patient afflicted with ovarian cancer comprising providing to cells of 

CC the patient an antisense oligonucleotide complementary to a marker of the 

CC invention. The markers are useful for assessing if a patient is afflicted 

CC with ovarian cancer, which involves comparing the level of expression of 

CC a marker in a patient sample and a normal level of expression of the 

CC marker in a control non-ovarian cancer sample. A difference between the 

CC expression levels indicates ovarian cancer. The level of expression of a 

CC marker corresponds to a secreted protein or to a transcribed 

CC polynucleotide or its portion. The level of expression of the marker is 

CC assessed by detecting the presence in the sample, a protein or protein 

CC fragment corresponding to the marker. The presence of protein or protein 

CC fragment is detected using an antibody that specifically binds with the 

CC protein or protein fragment. Alternatively, the level of expression of 

CC the marker is assessed by detecting the presence of a transcribed 

CC polynucleotide which anneals with the marker or anneals with a portion of 

CC the polynucleotide comprising the marker, under stringent conditions. The 

CC marker is also used for monitoring the progression of ovarian cancer in a 

CC patient which involves detecting expression of the marker in a patient 

CC sample at a first point in time, repeating the method at a subsequent 

CC time and comparing the level of expression. The method is carried out 

CC using an ovarian tissue sample. A composition comprising a marker, 

CC polypeptide or antibody of the invention is used to treat ovarian cancer. 

CC This sequence represents a human ovarian cancer DNA marker of the 

CC invention. Note; The sequence data for this patent did not form part of 

CC the printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences. 

XX 

SQ Sequence 3095 BP; 702 A; 832 C; 845 G; 711 T; 0 U; 5 Other; 

Query Match 97.6%; Score 1054.4; DB 5; Length 3095; 

Best Local Similarity 99.9%; Pred. No. 3.2e-271; 

Matches 1055; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 25 CAGGAT AAGGAAGCCT GT GT GGGT ACCAAC AATC AAAGCT AC ATCT GT GACACAGGAC AC 84 
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1 1 1 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 

Db 109 CAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAAAGCTACATCTGTGACACAGGACAC 168 

Qy 8 5 TGCTGTGGACAGTCTCAGTGCTGCAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTG 144 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 169 T GCT GT GG AC AGT CT C AGT GCT GC AACT ACT ACT AT GAACT CT GGT GGT T CT GGCT GGT G 228 

Qy 14 5 T GGACC AT CAT CAT C AT CCT GAGCT GCT GCT GT GTTT GC C AC C ACC GC CG AGCC AAGC AC 204 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 229 T GGACC AT CAT CAT CAT CCT GAGCT GCT GCT GT GT T T GCC AC CACC GC C G AGC C AAGC AC 288 

Qy 205 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 264 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I t I I I I I I I I I I I I I I I 

Db 289 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 348 

Qy 265 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 324 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 
Db 34 9 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 408 

Qy 325 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 384 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 409 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 4 68 

Qy 385 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 4 44 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 469 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 528 

Qy 44 5 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 504 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 529 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 588 

Qy 505 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 564 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 1 1 I I I I I II I I I I I I I I M I I I 
Db 589 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 648 

Qy ' 565 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 624 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 64 9 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 708 

Qy 625 GGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTCT 684 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 709 GGGGCCTTCCT.GGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTCT 7 68 

Qy 68 5 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 744 

II I I I I I I II I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I II I I I I II I 1 I I I I I I 

Db 7 69 GAACACGGCGC ACCCGAC AGC AAAGAGAAGAC GC CT GGGAGACAT C GC CGCT T C AC AGGT 828 

Qy 74 5 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 804 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 82 9 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 888 

Qy 805 TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 864 

1 .1 I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 889 TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 948 

Qy 865 GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGA 924 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 94 9 GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGCCCTCTGAGGAGCAGGCTCGA 1008 

Qy 925 GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 984 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I II 

Db 1009 GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 1068 

Qy 985 GAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAG 104 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 1069 GAGC AGGACT CT CC C AACT C CC AG AGC AGC AGCT CC C C C AGC TAGAGC AGGT CCT GCC AG 1128 

Qy 104 5 CACC C AGC AACT TGGCAAAGCAACCAGGGTAGGGGA 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1129 CACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1164 



RESULT 7 
ADP48576 

ID ADP48576 standard; DNA; 1140 BP. 
XX 
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AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR. 
PR 
PR 
XX 



ADP48576; 

09-SEP-2004 (first entry) 

Human OPAL1/G0 DNA splice form incorporating exon la SeqID 3. 

human; gene; ds; OPAL1; GO; childhood leukaemia; G protein beta 2; Gl; 
interleukin-10 receptor alpha; G2; IL-10; FYN-binding protein; PBK1; 
cytostatic; acute lymphoblastic leukaemia; ALL; acute myeloid leukaemia; 
AML. 



Homo sapiens. 

Key 
CDS 



Location/Qualifiers 
1. .1092 
/*tag= a 

/product^ "OPAL1/G0 protein {incorporating exon la)' 



WO2004053074-A2. 



24-JUN-2004 . 



05-DEC-2003; 2003WO-US038738 . 



06-DEC- 
06-DEC- 
06-DEC- 
14-OCT- 
14-OCT- 
05-DEC- 
05-DEC- 



2002; 
2002; 
2002; 
2003; 
2003; 
2003; 
2003; 



2002US- 
2002US- 
2002US- 
2003US- 
2003US- 
2003US- 
2003US- 



O432064P. 
0432077P. 
0432078P. 
0510904P. 
0510968P. 
00729895. 
0527610P. 
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SCORE Search Results Details for Application 

10729895 and Search Result 
20070109_162353_us-10-729-895a-l-rge. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 

20070109_162353_us-10-729-895a-l.rge. 

start I next page 

Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 



January 11, 2007, 01:00:29 ; Search time 6422 Seconds 

(without alignments} 
10754.160 Million cell updates/sec 

US-10-729-895A-1 
1080 

1 atgcctttccttttgggtct . . . aaagcaaccagggtagggga 1080 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 6366136 seqs, 31973710525 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 4 5 summaries 



12732272 



Database : 



GenEmbl :* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 



gb_env:* 
gb_pat : * 
gb_ph : * 
gbjpl:* 
gb_pr : * 
gb_ro : * 
gb_sts:* 
gb_sy: * 
gb_un : * 
gb_vi : * 
gb_ov:* 
gb_htg: * 
gb_in : * 
gb_om : * 
gb_ba:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AX337966 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



FEATURES 

source 



AX337966 
Sequence 7 
AX337966 
AX337966.1 



2976 bp 
from Patent WO0194391. 

GI:18128677 



DNA 



linear PAT 09-JAN-2002 



ORIGIN 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 
Hominidae; Homo. 
1 

Yue,H., He, A., Nguyen, D.B., Yao,M.G., Bandman,0., Burford,N., 
Tang,Y.T., Xu,Y., Hafalia,A., Azimzai,Y. and Walia,N.K. 
Intracellular signaling proteins 
Patent: WO 0194391-A 7 13-DEC-2O01; 
Incyte Genomics, Inc. (US) 

Location/ Qualifiers 

1. .2976 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/ db_x re f = " t axon : 9 60 6" 
/note="Incyte ID No: 1478005CB1" 



Query Match 



100.0%; Score 1080; DB 2; Length 2976; 



http://es/ScoreAccessW 1/22/2007 



Best Local Similarity 100.0%; Pred. No. 1.4e-310; 

Matches 1080; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I i I I I I I 
Db 47 9 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 538 

Qy 61 AGCT AC AT CT GT GAC AC AGG AC AC T GCT GT GG AC AGT CTCAGT GCT GC AACT AC TACT AT 120 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I 
Db 539 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 598 

Qy 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 599 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 658 

Qy 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 

Db 659 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 718 

Qy 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 719 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 778 

Qy 301 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 360 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 779 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 838 

Qy 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I 
Db 839 CC AT AC AGTGCCTT CC AGCT AC AGCAGC AGCAGCTGCT GC CT CC AC AGT GT GGC CCTGC A 898 

Qy 421 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 4 80 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 899 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 958 

Qy 481 T T GT CT GAGC CC AGCAGAAGC AGC AC AAGACC CC C AAGC AT C GCT GACCCT G AT CC CT CT 540 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 959. TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 1018 

Qy 541 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1019 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 1078 

Qy 601 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I 
Db 1079 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 1138 

Qy 661 GAGC T GCT GAAAGAT GAC AGCT CT GAAC ACGGCGCACC CGAC AGC AAAGAGAAG AC GC CT 7 20 

I I I I I I l'l I I I I I I I I I I I I I I I i I I I I I I I I i I I I I I I I I I I I I I 1 I I I 1 I I I I I I I I I 
Db 1139 GAGCT GCT GAAAGAT GAC AGCT CT GAAC ACGGCGCACC CGAC AGC AAAGAGAAGACGCCT 1198 

Qy 721 GGGAGAC ATCGCCGCT T C AC AGGT GACT CGGGC ATT GAAGT GT GT GT GT GC AAC CGGGGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I l'l I I I I I I I I I I I I I I I I I I I I I 
Db 1199 GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAAC CGGGGC 1258 

Qy 781 C ACC AT GACGAT GACCT C AAAG AGTT C AAC AC ACTC AT CGAT GAT GCT CT GGAT GGGC CC 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1259 C ACC AT GACGAT GACCT C AAAG AGTT CAAC AC ACTC AT CGAT GAT GCT CT GGAT GGGC CC 1318 

Qy 841 CT GGAC TT CT GC GAC AGCT GC CAT GT GC GGCC CC CT GGT GAT GAGG AGGAAGGC CT CT GT 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1319 CT GGAC T T CT GC GAC AGCT GC CAT GT GC GGCC CC CT GGT GAT GAGG AGGAAGGC CT CT GT 1378 

Qy 901 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 960 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I 
Db 137 9 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 14 38 

Qy 961 TGCC T GCT GCTG AAC ACC AT CAAC GAGC AGGACTCTCCC AACT CCC AG AGC AGC AGCT CC 1020 

I I I I I I I I I I I I l'l I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 11 
Db 14 39 T GCCT GCT GCT GAAC ACC AT CAAC GAGC AGGACTCTCCC AACT CCCAGAGC AGC AGCT CC 14 98 

Qy 1021 CCC AGCT AGAGC AGGT CCTGCC AGC ACC C AGC AACT T GGC AAAGC AAC CAGGGTAGGGGA 1080 

I I 1 1 I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I II I 1 1 I I I I I I M I I I I I I I I I I I I 
Db 14 99 CCCAGCTAGAGCAGGT CCTGCC AGCACCCAGC AACT TGGCAAAGCAACCAGGGTAGGGGA 1558 
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RESULT 2 
AX714139 

LOCUS AX714139 2703 bp DNA linear PAT 15-APR-2003 

DEFINITION Sequence 823 from Patent EP1293569. 
ACCESSION AX714139 

VERSION AX714139.1 GI:29889067 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 

Hominidae; Homo. 
REFERENCE 1 

AUTHORS Isogai,T., Sugiyama,T., Otsuki,T., Wakamatsu , A. , Sato,H., Ishii,S., 
Yamamoto, J. I . , Isono,Y., Hio,Y., Otsuka,K., Nagai,K., Irie,R., 
Tamechika, I . , Seki,N., Yoshikawa,T . , Otsuka,M., Nagahari,K. and 
Masuho, Y. 

TITLE Full-length cDNAs 

JOURNAL Patent: EP 1293569-A 823 19-MAR-2003; 

Helix Research Institute (JP) ; Research Association for 
Biotechnology (JP) 
FEATURES Location/Qualifiers 
source 1. .2703 

/organism="Homo sapiens" 
/mol_type="unassigned DNA"' 
/db_xref="taxon:9606" 

ORIGIN 

Query Match 99.7%; Score 1076.8; DB 2; Length 2703; 

Best Local Similarity 99.8%; Pred. No. 1.3e-309; 

Matches 1078; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I 1 1 I I I I I I ! I I I I I I I 1 1 I I I I I I I I II 
ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 204 

AG CT AC AT CT GT GAC A C AGG AC ACT GCT GT GG AC AGT CT C AGT GCT GC AACT AC TACT AT 120 

II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 2 64 

GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I 

GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 324 

TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 24 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I II I I I I I I I I I I I I I I I 
T GCC ACC ACC GC CG AGCC AAGC ACCGCCTT C AGGCC C AGC AGCGGC AAC ATG AAAT C AAC 384 

CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I 1 1 

CTGATCGCTTACCGAGAAGCCCAC7V\TTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 4 44 - 

CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 3 60 
I I I I i I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I II I I t I t I 
CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 504 

CC AT AC AGT GCCTT CC AGCT AC AGC AGC AGC AGCTGCT GCCT CC AC AGTGTGGC CCT GCA 4 20 
I I I I I I I I I I I I I I I 1 I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I ! I I I 
CCAT ACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 564 

GGT GGC AGT C CC CC GGGC AT C G AT CCC AC CAGGGGATCCCAGGGGGC AC AGAGC AGC CCC 4 80 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 

GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 624 

TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 684 

GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600 

II I I I I I I I I I I I I I t I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I 
GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 744 

GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGT AGGGAG 6 60 
1 1 1 1 I I 1 1 I I I I 1 1 I I i I I I I I I I I I 1 1 1 1 I I I I I I I I 1.1 I I I I I M I I I I I t I I I I I 1 1 
GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGT AGGGAG 8 04 



Qy 


1 


Db 


145 


Qy 


61 


Db 


205 


Qy 


121 


Db 


265 


Qy 


181 


Db 


" 325 


Qy 


241 


Db 


385 


Qy 


301 


Db 


445 


Qy 


361 


Db 


505 


Qy 


421 


Db 


565 


Qy 


481 


Db 


625 


Qy 


541 


Db 


685 


Qy 


601 


Db 


745 
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Qy 


•661 


Db 


805 


Qy 


721 


Db 


865 


Qy 


781 


Db 


925 


Qy 


841 


Db 


985 


Qy 


901 


Db 


1045 


Qy 


961 


Db 


1105 


Qy 


1021 


Db 


1165 



GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGC7U\AGAGAAGACGCCT 720 
I I I I I I I I I I I I I !! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I 
GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 864 

GGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 7 80 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I t I I I I I I I I I I I I I 
GGGAGAC AT C GC CGCT T C AC AGGT GACT CGGGCATT GAAGT GT GT GT GT GC AAC C GGGGC 924 

CACC AT GACGAT GACCTC AAAGAGTT CAAC AC ACTC AT CGAT GATGCT CT GGAT GGGC CC 840 
I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
CACC AT GACGAT GACCTC AAAGAGTT CAAC AC ACTC AT CGAT GATGCT CT GGAT GGGCCC 984 

CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 900 
I I I I I I I II I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I h I I I I I I 
CTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 104 4 



CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 
I I I I I I I I I I I I I I I I 1 I I I I II I I I I I I I I I I I I I I I I I I 



960 



1104 



T GCCTGCT GCTGAACACC AT CAACGAGC AGGACT CT CCCAACT C CC AGAGCAGC AGCT CC 1020 
1 1 I I I I I I I I I I I I M 1 1 I I I I I I I M I 1 1 II III I I I I I I I 1 1 I I I I I I I I I I I I 1 1 I I 
T GCCT GCT GCT G AACAC C AT CAACGAGC AGGACT CT C C C AACT C CC AG AGCAGC AGCT CC 1164 

CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGT AGGGGA 1080 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
CCCAGCTAGAGC AGGT CCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGT AGGGGA 1224 



RESULT 3 
AK056285 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 



AK056285 2703 bp mRNA linear PRI 20-JAN-2006 

Homo sapiens cDNA FLJ31723 fis, clone NT2RI2006682 , weakly similar 
to Human WW domain binding protein-1 mRNA. 
AK056285 

AK056285.1 GI:16551642 

oligo capping; fis (full insert sequence). 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 

Ota,T., Suzuki, Y., Nishikawa,T . , Otsuki,T., Sugiyama,T., Irie,R., 
Wakamatsu, A. , Hayashi,K., Sato,H., Nagai,K., Kimura,K., Makita,H., 
Sekine,M., Obayashi,M., Nishi,T., Shibahara, T . , Tanaka,T., 
Ishii,S., Yamamoto,J., Saito,K., Kawai,Y., Isono,Y., Nakamura,Y., 
Nagahari,K., Murakami, K. , Yasuda,T., Iwayanagi,T . , Wagatsuma,.M. , 
Shiratori, A. , Sudo,H., Hosoiri,T., Kaku,Y., Kodaira,H.-, Kondo,H., 
Sugawara,M., Takahashi,M. , Kanda,K., Yokoi,T., Furuya,T., 
Kikkawa,E., Omura,Y., Abe,K., Kamihara,K., Katsuta,N., Sato,K., 
Tanikawa,M., Yamazaki,M., Ninomiya,K., Ishibashi , T . , Yamashita , H . , 
Mu ra kawa , K . , Fujimori, K., Tanai,H., Kimata,M., Watanabe,M., 
Hiraoka,S., Chiba,Y., Ishida,S., Ono,Y., Takiguchi , S . , Watanabe,S., 
Yosida,M., Hotuta,T., Kusano,J., Kanehori, K. , Takahashi-Fu jii, A. , 
Hara,H., Tanase,T.O., Nomura, Y., Togiya,S., Komai,F., Hara,R., 
Takeuchi, K. , Arita,M., Imose,N., Musashino, K. , Yuuki,H., Oshima,A., 
Sasaki, N., Aotsuka,S., Yoshikawa , Y . , Matsunawa , H . , Ichihara,T., 
Shiohata,N., Sano,S., Moriya,S., Momiyama,H., Satoh,N., Takami,S., 
Terashima, Y. , Suzuki, O., Nakagawa,S., Senoh,A., Mizoguchi, H . , 
Goto,Y., Shimizu,F., Wakebe,H., Hishigaki, H . , Watanabe,T., 
Sugiyama,A., Takemoto,M., Kawakami,B., Yamazaki,M. , Watanabe,K., 
Kumagai,A., Itakura,S., Fukuzumi,Y., Fujimori, Y., Komiyama,M., 
Tashiro,H., Tanigami,A., Fujiwara,T., Ono,T., Yamada,K., Fujii,Y., 
Ozaki,K., Hirao,M., Ohmori,Y., Kawabata, A. , Hikiji,T., Kobatake,N., 
Inagaki,H., Ikema,Y., Okamoto,S., Okitani,R., Kawakami,T., 
Noguchi,S., Itoh,T., Shigeta,K., Senba,T., Matsumura , K . , 
Nakajima,Y., Mizuno,T., Morinaga,M., Sasaki, M., Togashi,T., 
Oyama,M., Hata,H., Watanabe,M., Komatsu,T., Mizushima-Sugano, J. , 
Satoh,T., Shirai,Y., Takahashi , Y . , Nakagawa,K., Okumura,K., 
Nagase,T., Nomura, N . , Kikuchi,H., Masuho,Y., Yamashita , R. , 
Nakai,K., Yada,T., Nakamura,Y., Ohara,0., Isogai,T. and Sugano,S. 
Complete sequencing and characterization of 21,243 full-length 
human cDNAs 

Nat. Genet. 36 (1), 40-45 (2004) 
14702039 
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REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



Ishibashi,T. , Kanehori, K. , Yosida,M., Watanabe,S., Ishida,S., 
Ono,Y., Hotuta,T., Hiraoka,S., Murakawa,K., Ta kiguchi, S . , 



FEATURES 

source 



Fujimori, K. , Tanai,H., Ishida,M., 
Sugiyama,T . , Irie,R., Otsuki,T., Sato,H., 
Yamamoto,J., Isono,Y., Kawai-Hio, Y. , 
Kimura,K., Matsuo,K., Nakamura,Y., 
Kanda,K., Wagatsuma , M . , Takahashi-Fu jii, A . , 
Kawakami,B., Suzuki, Y. , Sugano,S., 



Kusano,J., Watanabe,M., 
Yamashita, H. , Chiba,Y., 
Wakamatsu, A. , Ishii,S. ( 
Saito,K., Nishikawa,T. , 
Sekine,M., Kikuchi,H., 
Oshima,A., Sugiyama,A. 
Nagahari,K. , Masuho,Y.; Nagai,K. and Isogai,T. 
NEDO human cDNA sequencing project 
Unpublished 
3 (bases 1 to 2703) 
Isogai,T., Otsuki,T. and Sugiyama,T. 
Direct Submission 

Submitted (24-OCT-2001 ) Takao Isogai, Helix Research Institute, 
Genomics Laboratory; 1532-3 Yana, Kisarazu, Chiba 292-0812, Japan 
(E-mail: flj-cdna@nifty.com, Tel : 81-4 38-52-3975, Fax:81-4 38-52-3986) 
NEDO human cDNA sequencing project supported by Ministry of 
Economy, Trade and Industry of Japan; cDNA full insert sequencing: 
Research Association for Biotechnology (RAB) ; cDNA library 
construction: Helix Research Institute (HRI) (supported by Japan 
Key Technology Center etc.); 5'- & 3 '-end one pass sequencing: RAB, 
HRI, and Biotechnology Center, National Institute of Technology and 
Evaluation; clone selection for full insert sequencing: RAB and 
HRI. 

Location/Qualif iers 
1. .2703 

/organism="Homo sapiens" 
/ mo 1_ t y pe = " mRN A " 
/db_xref="taxon : 9606" 
/clone="NT2RI2006682" 
/cell_line="NT2" 
/cell_type="tera tocarcinoma " 
/clone_lib="NT2RI2" 
/note="cloning vector: pME18SFL3 

mRNA from NT2 neuronal precursor cells treated 2-weeks 
mitotic inhibitor after 5-weeks retinoic acid (RA) 
induction . 
majorly NT 2 neuron" 



ORIGIN 



Query Match 99.7%; 
Best Local Similarity 99.8%; 
Matches 1078 ; Conservative 



Score 1076.8; DB 5; Length 2703; 
Pred. No. 1.3e-309; 
0; Mismatches 2; Indels 0; Gaps 



0; 



Qy 


i 


Db 


145 


Qy 


61 


Db 


205 


Qy 


121 


Db 


265 


Qy 


181 


Db 


325 


Qy 


241 


Db 


385 


Qy 


301 


Db 


445 


Qy 


361 


Db 


505 


Qy 


421 



AT GC CT TT CCTTTT GGGT CTT AGACAGGAT AAGGAAGCCT GT GT GGGT AC CAAC AATC AA 6 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



AGCT ACAT CT GT GACACAGGACACTGCT GT GGAC AGT CTC AGTGCT GCAACT ACTACT AT 
I t I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



120 



264 



GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 
I I t i I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I 



TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 
I I I I I I I I I II I I I I II I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I 



180 



324 



240 



TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 384 

CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

I I I I I I I I I I I I I I I t I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

CT GAT CGC T T AC CG AG AAGC C C AC AATT AC TC AGC GCT GC C ATT TT AT TT C AGGTT TT T G 4 44 

CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 3 60 

I I 1 1 I I I I I I I I I I I I I I I I I I I M I I I I I I I 1 1 I I I I I I I I I I I I I I I I 1 1 I I I I I M I 
CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 504 

CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 4 20 
I I I I I I I I I I I I I I I I I I I I I I j I I 1 I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 1 I I I I 
CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCA 5 64 



I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 
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Db 565 GGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCC 624 

Qy 481 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I J I I i I I I I I I I I I I I I 
Db, 625 TTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCT 684 

Qy 541 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 600 

I I I I I I I I I I I II I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 685 GACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTG 744 

Qy 601 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 660 

I I I I I t I I I I I I M I I I I I II t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I 

Db 74 5 GCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAG 804 

Qy 661 GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 7 20 

I I M I m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 

Db 805 GAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCT 8 64 

Qy 721 GGGAGAC AT CGCCGCT T C AC AGGT GACT CGGGC ATT GAAGT GT GT GT GT GC AAC C GGGGC 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 865 GGGAGACATCGCCGCTTC AC AGGT GACT CGGGCATTGAAGTGTGTGTGTGCAACCGGGGC 924 

Qy .781 CACC AT GACGAT GACCTC AAAGAGTT CAAC AC ACTC AT CGAT GAT GCT CT GGAT GGGCCC 840 

I 11 I I I I I I I I I I I I III I I I I I I I I I I I I I I I I 

Db 925 CACC AT GACGAT GACCTC AAAGAGTT CAAC AC ACTC AT CGAT GAT GCT CTGGAT GGGCCC 984 

Qy 841 CT GGACTT CTGCGACAGCTGCC AT GT GCGGCCCCCT GGTGAT GAGGAGGAAGGCCT CT GT 900 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 1 I I I I I I I II I I I I I I I I I I I I I I I II I I 

Db 985 CT GGACTT CTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGT 104 4 

Qy 901 CAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCA 960 

II I I I I I I I I I M I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 104 5 CAGCCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCTCA 1104 

Qy 961 TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 1020 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 i'i 1 1 M 1 1 1 1 1 1 

Db 1105 TGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCC 1164 

Qy 1021 CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1080 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 1165 CCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGGGA 1224 



RESULT 4 
CQ41304 2 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

FEATURES 

source 



ORIGIN 



linear PAT 23-JAN-2004 



CQ413042 3095 bp DNA 

Sequence 20113 from Patent WO0170979. 
CQ413042 

CQ413042.1 GI:41320823 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 
Hominidae; Homo. 
1 

Lee, J. and Lillie,J. 

Genes, compositions, kits, and method for identification, 
assessment, prevention, and therapy of ovarian cancer 
Patent: WO 0170979-A 20113 27-SEP-2001; 
Millennium Pharmaceuticals, Inc. (US) 

Location/Qualifiers 

1. .3095 

/organism="Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref="taxon:9606" 



Query Match 97.6%; 
Best Local Similarity 99.9%; 
Matches 1055; Conservative 



Score 1054 .4; DB 2; 
Pred. No. 6.5e-303; 
0; Mismatches 1; Indels 



Length 3095; 

0; ' Gaps 



Qy 



Db 



25 CAGGAT AAGGAAGCCT GT GT GGGT ACCAAC AATC AAAGCT AC AT CT GTGAC AC AGGAC AC 84 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I 
109 CAGGAT AAGGAAGC CT GT GT GGGT AC CAAC AAT C AAAGCT AC AT CT GT GAC AC AGGAC AC 168 
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Qy 85 TGCT GT GGAC AGT CTC AGTGCT GC AACT ACT ACT AT GAACT CTGGT GGTT CT GGCT GGTG 144 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I i I I I I I I I i I I I I I I I I I I 
Db 169 T GCT GT GGAC AGT CTC AGT GCT GCAACTACTACTAT GAACT CT GGT GGT T CT GGCT GGTG 228. 

Qy 14 5 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 204 

I I ! I t I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 229 T GGAC CAT CAT CAT CAT C CT GAGCTGCT GCT GT GTT T GCC ACC AC CGCCG AGC C AAGC AC 288 

Qy 205 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 264 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I t I I I I I I I I I I I I I I I I I I I I I I I 
Db 289 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 348 

Qy 265 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 324 

I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I 
Db 34 9 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTAT 4 08 

Qy 325 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 384 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 409 GAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAG 4 68 

Qy 385 C AGC AGC AGCT GCT GC CT CC AC AGTGTGGC CCT GC AGGT GGC AGT CCCCCGGGCATC GAT 4 44 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 69 CAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGAT 528 

Qy 44 5 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 504 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 529 CCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGC 588 

Qy 505 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 564 

i I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 589 ACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCC 648 

Qy 565 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 624 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 64 9 ACCAAAGCCCCAGGGATGGAGCCCAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCG 7 08 

Qy 625 GGGGCCTT CCT GGACAAAGATGCAGAAT GT AGGGAGGAGCTGCT GAAAGATGAC AGCT CT 684 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 709 GGGGCCTT CCT GGACAAAGATGCAGAATGT AGGGAGGAGCTGCT GAAAGATGAC AGCT CT 7 68 

Qy 685 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 74 4 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 769 GAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGT 828 

Qy 74 5 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 804 

I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I 

Db 829 GACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAG 888 

Qy 805 TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 8 64 

I I 1 1 1 1 1 1 I I I I i I I I I I I I I II I I ! I I I I I I 1 1 1 1 1 1 1 1 I I I I I I I I 1 1 I I I I I I I I I I 

Db 889 TTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCAT 948 

Qy 865 GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGA 924 

I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 94 9 GTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGCCCTCTGAGGAGCAGGCTCGA 1008 

Qy 925 GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 984 

I I I I I I i I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I II I I I 
Db 1009 GAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAAC 1068 

Qy 985 GAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAG 104 4 

I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1069 GAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAG 1128 

Qy 104 5 C ACC C AGC AACTT GGC AAAGC AAC C AGGGT AGGGGA 1080 

I I II I I I I I I I I I I I I I I I I I II I I I I I I I I 1 I I I I 
Db 1129 C ACC C AGC AACTT GGC AAAGC AAC C A GGGT AGGGGA 1164 



RESULT 5 
BD275013 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 



BD275013 4054 bp 

50 Human Secreted Proteins. 
BD275013 

BD275013.1 GI:33084781 
JP 2002542766-A/13. 



DNA 



linear PAT 17-JUL-2003 
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SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleos tomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
REFERENCE 1 (bases 1 to 4054) 

AUTHORS Komatsoulis, G. , Rosen, C. A. and Ruben, S.M. 
TITLE 50 Human Secreted Proteins 

JOURNAL Patent: JP 2002542766-A 13 17-DEC-2002; 
Rosen et al 
COMMENT OS Homo sapiens 

PN JP 2002542766-A/13 

PD 17-DEC-2002 

PF 09-MAR-2000 JP 2000605624 

PR 12-MAR-1999 US 60/ 124093 , 23-NOV-1999 US 60/166989 PI 
george koma tsoulis , craig a rosen,steven m ruben CC 
FH Key Location/Qualifiers. 
FEATURES Location/Qualifiers 
source 1. .4054 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/ db_x re f = " t axon : 9 60 6 " 

ORIGIN 

Query Match ' 96.6%; Score 1043 .4; DB 2; Length 4054; 

Best Local Similarity 99.8%; Pred. No. 1.4e-299; 

Matches 1055; Conservative 0; Mismatches 1; Indels 1; Gaps 1; 

Qy 25 CAGGAT AAGGAAGCCT GT GT GGGT ACC AAC AATCAAAGCT ACAT CT GT GACACAGGAC AC 84 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I ! I I I i I I I I I I I I I I I I I 
Db 107 CAGGAT AAGGAAGCCT GT GT GGGT ACCAAC AATCAAAGCT ACAT CTGT GACACAGGAC AC 166 

Qy 8 5 TGCTGTGGACAGTCTCAGTGCTGCAACTACTACTATGAACTCTGGTGGTTCTGGCTGGTG 144 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 167 TGCTGT GGACAGTCTCAGTGCT GC AACT ACT ACT AT GAACTCTGGT GGTT CTGGCT GGTG 226 

Qy 14 5 TGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCAC 204 

I I I I I I I I i I I i I I I I I I I I I I I I I I I I I t I I I I I ! I I I I I I I I I I I I I II I I I I I I I I I 
Db 227 T GGACC AT CAT CAT CAT C CT GAGC T GCT GCT GTGTT T GC C AC C ACC GC CG AGC C AAGC AC 286 

Qy 205 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 2 64 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I II I I I I I I I I I I I I 
Db 287 CGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCAC 34 6 

Qy 265 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCC-TTA 323 

I t I I I I I I I I I I I I I I j I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I II I I I I I I III 
Db 34 7 AATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTTA 4 06 

Qy 324 TGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACA 383 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 407 TGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACA 4 66 

Qy 384 GCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGA 4 43 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I ! I I I I I I II I I I I I I I I II I I I I I I I I 
Db 4 67 GC AGCAGC AGCT GCT GCCTC CAC AGT GTGGCC CT GC AGGT GGC AGT CC CC CGGGC AT C GA 526 

Qy 44 4 TCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAG 503 

I I I I I I I I I I I I I I i I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 527 TCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAG 586 

Qy 504 CACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGC 563 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 587 CACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGC 64 6 

Qy 564 CACC AAAGCCCC AGGGAT GGAGCCCAGT GGCT CT GT GGCT GGCCTGGGGGAGCT GGAC CC 623 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 64 7 CACC AAAGCCCC AGGGAT GGAGCCCAGT GGCT CT GT GGCT GGCCTGGGGGAGCT GGAC CC 7 06 

Qy 624 GGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTC 683 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 707 GGGGGCCTTCCT GGAC AAAGATGCAGAATGTAGGGAGG AGCT GCT GAAAG AT GAC AGCT C 7 66 

Qy 684 TGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGG 74 3 

I I I I I I I I 1 1 I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I 
Db 767 TGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGG 826 
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Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

. Db 

Qy 

Db 



744 



827 



804 



887 



864 



T G ACT CGGGC AT T GAAGT GT GT GT GT GC AACC GGGGC C ACC AT G AC GAT G AC CT C AAAGA 8 03 
I I I I I I I II I I I i I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGA 886 

GT TC AAC AC ACT CATC GAT GAT GCT CT GGATGGGC C CCT GGACT T C T GCG AC AGCT GCC A 8 63 
I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCA 946 



TGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCG 923 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 1 I I I I II I II I I I I I I II I 
94 7 TGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCG 1006 



924 



983 



AGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAA 

I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II 

1007 AGAGCCTGGGCACCCGGACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAA 1066 

984 CGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCA 104 3 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1067 CGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCA 112 6 

104 4 GC ACCC AGCAACTT GGCAAAGC AACC AGGGT AGGGGA 1080 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1127 GC ACCC AGCAACTT GGCAAAGC AACCAGGGT AGGGGA 1163 
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JOURNAL 

FEATURES 

source 



ORIGIN 



CQ720290 1002 bp DNA linear PAT 03-FEB-2004 

Sequence 6224 from Patent W002068579. 

CQ720290 

CQ720290.1 GI:42281147 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 
Hominidae; Homo. 
1 

Venter, C". J. , Adams, M.C., Li,P.W. and Myers, E.W. 

Kits, such as nucleic acid arrays, comprising a majority of 

humanexons or transcripts, for detecting expression and other uses 

thereof 

Patent: WO 02068579-A 6224 06-SEP-2002; 
PE Corporation (NY) (US) 

Location/Qualif iers 

1. .1002 

/organism= rt Homo sapiens" 
/mol_type="unassigned DNA" 
/db xref="taxon:9606" 



Query Match 92.8%; Score 1002; DB 2; Length 1002; 

Best Local Similarity 100.0%; Pred. No. 2e-287; 

Matches 1002; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 



Db 



28 GATAAGGAAGCCTGTGTGGGTACCAACAATCAAAGCTACATCTGTGACACAGGACACTGC 87 
I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 GATAAGGAAGCCTGTGTGGGTACCAACAATCAAAGCTACATCTGTGACACAGGACACTGC 60 



Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



88 



61 



148 



121 



T GT GGACAGT CT C AGT GCT GC AACTACT ACT AT GAACT CT GGTGGTTCT GGCT GGT GT GG 
I I I I 1 I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II" I I I I I 
T GT GGACAGT CT CAGT GCT GC AACTACT ACT AT GAACT CT GGT GGT T CT GGCT GGT GT GG 



147 



120 



207 



ACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACCACCGCCGAGCCAAGCACCGC 
I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AC CAT CAT CAT CAT CCTG AGCT GCT GCT GT GT TT GC C ACC ACC GCC GAGC C AAGCACCGC 180 



208 



267 



CTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCACAAT 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I II I 
181 CTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCGCTTACCGAGAAGCCCACAAT 240 

268 TACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTATGAG 327 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
241 TACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTATGAG 300 

328 GAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAGCAG 387 
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I I M I I I I 1 1 I I I I I II I I I I I I I 1 1 I I II I I 1 1 I I I I I I I I I I I I 1 1 1 I I I I I I I I I I I 

Db 301 GAAGT GGT GAAC CG AC CT CC AACT CCT C CC CC AC CAT AC AGT GC CT T C C AGC T AC AGC AG 360 

Qy 388 CAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGATCCC 447 

I I I I I I I I I I I I I I I I I I I If I I I I I I I I I I I I.I I I i I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 CAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGATCCC 4 20 

Qy 448 ACC A GGGG AT CC CAGGGGGC AC AG AGC AGC CC CT T GT CT G AGC C C AGC AGAAGC AGC AC A 507 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 AC CAGGGGATCCCAGGGGGC AC AGAGCAGCCCCTTGTCTGAGCCCAGC AGAAGC AGCACA 4 80 

Qy 508 AGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCCACC 567 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 481 AGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCCACC 540 

Qy 568 AAAGCCCC AGGGAT GGAGCC CAGT GGCT CT GT GGCT GGCCTGGGGGAGCT GG AC CC GGGG 627 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 54 1 AAAGCCCC AGGGAT GGAGCC CAGT GGCT CT GT GGCT GGCCTGGGGGAGCT GGAC CCGGGG 600 

Qy 628 GC CT T CCT GGAC AAAG AT GC AGAAT GT AGGGAGG AGCT GCT G AAAG AT GAC AGC T CT GAA 687 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 GC CT T CCT GGAC AAAG AT GC AGAAT GT AGGGAGG AGCT GC T G AAAG AT GAC AGC T CT GAA 660 

Qy 688 CACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGTGAC 747 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 661 CACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGTGAC 720 

Qy 74 8 TCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTC 807 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I 1 I I I I I I 
Db 721 TCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTC 780 

Qy 808 AACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCATGTG 867 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 781 AACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCATGTG 840 

Qy 868 CGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGAGAG 927 

I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 1 I 
Db 841 CGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGAGAG 900 

Qy 928 CCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAACGAG 987 

I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 CCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGCTGCT GAAC AC CATC AACGAG 960 

Qy 988 CAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAG 1029 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i.l I I I I I 1 I 
Db 961 CAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCTAG 1002 
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AL358790 131753 bp DNA linear PRI 18-MAY-2005 

Human DNA sequence from clone RP11-753C18 on chromosome 10 Contains 
three novel genes, the CYP17A1 gene for cytochrome P450 17A1, a 
profilin 1 (PFN1) pseudogene, the gene for methyltransferase cytl9, 
an unactive progesterone receptor 23 kD {TEBP) pseudogene, a 
ribosomal protein L22 (RPL22) pseudogene and a CpG island, complete 
sequence . 
AL358790 

AL3587 90.22 GI: 16972912 

HTG; CpG island; CYP17A1; cytl9; methyltransferase; PFN1; profilin; 

RPL22; TEBP. 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 

Hominidae; Homo. 

1 (bases 1 to 131753) 

Tromans, A. 

Direct Submission 

Submitted ( 13-MAY-2005) Wellcome Trust Sanger Institute, Hinxton, 
Cambridgeshire, CB10 ISA, UK. E-mail enquiries: vega@sanger.ac.uk 
Clone requests: clonerequest@sanger.ac.uk 

On Nov 16, 2001 this sequence version replaced gi: 15131808. 
The following abbreviations are used to associate primary accession 
numbers given in the feature table with their source databases: 
Em:, EMBL; Sw: , SWISSPROT; Tr:, TREMBL; Wp:, WORMPEP; Information 
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on the WORMPEP database can be found at 

http://www.sanger.ac.uk/Projects/C_elegans/wormpep This sequence 
was generated from part of bacterial clone contigs of human 
chromosome 10, constructed by the Sanger Centre Chromosome 10 
Mapping Group. Further information can be found at 
http://www.sanger.ac.uk/HGP/ChrlO 

RP11-753C18 is from the library RPCI-11.3 constructed by the group 
of Pieter de Jong. For further details see 
http://www.chori . org/bacpac/home . htm 
VECTOR: pBACe3.6 

: Genome Center 

Center: Wellcome Trust Sanger Institute 
Center code: SC 

Web site: http://www.sanger.ac.uk 
Contact: vega@sanger.ac.uk 



misc feature 



gene 



This sequence was finished as follows unless otherwise noted: all 
regions were either double-stranded or sequenced with an alternate 
chemistry or covered by high quality data (i.e., phred quality >= 
30); an attempt was made to resolve all sequencing problems, such 
as compressions and repeats; all regions were covered by at least 
one subclone; and the assembly was confirmed by restriction digest, 
except on the rare occasion of the clone being a YAC . 
FEATURES Location/Qualifiers 
source 1. .131753 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref="taxon : 9606" 
/ ch r omo s ome ="10" 
/clone="RPll-753C18" 
/clone_lib="RPCI-11.3" 
1 

/note="Clone_left_end: RP11-753C18" 
3299. .43314 
/gene="C10orf26" 
/locus_tag="RPll-753C18 .10-001" 
mRNA join(3299. .3467,25043. .2514 5,3694 3. .37104,3964 5. .43314) 

/gene="C10orf26" 
/locus_tag="RPll-753C18. 10-001" 
/product="chromosome 10 open reading frame 26" 
/note="match: ESTs: AW006719.1 BF852297.1 BG258775.1 
BG720326.1 BQ307980.1 BU537726.1 BU616707.1 BU618493.1 
BU784158.1 

match: cDNAs: AK000374.1 AK056285.1 BC026369.1 U43563.1" 
CDS join{3441. .3467,25043. .2514 5,3694 3. .37104,3964 5. .40381) 

/gene="C10orf26" 
/locus_tag="RPll-753C18. 10-001" 
/standard_name="OTTHUMP00000020381" 

/note="match: proteins: 095637 P97764 Q8R0W8 Q8WUP5 Q99J20 
Q9NX94" 

/codon_start=l 

/product="chromosome 10 open reading frame 26" 
/protein_id="CAI524 96 . 1 " 
/db_xref="GI : 58864791" 
/db_xref="UniProtKB/TrEMBL:Q5F2G6" 

/ translation= "MPFLLGLRQDKEACVGTNNQSYICDTGHCCGQSQCCNYYYELWW 
FWLVWT I I I I LSCCCVCHHRRAKHRLQAQQRQHEI NLIAYREAHN YS ALPFYFRFLPN 
YLLPPYEEWNRPPTPPPPYSAFQLQQQQLLPPQCGPAGGSPPGIDPTRGSQGAQSSP 
LSEPSRSSTRPPSIADPDPSDLPVDRAATKAPGMEPSGSVAGLGELDPGAFLDKDAEC 
REELLKDDSSEHGAPDSKEKTPGRHRRFTGDSGIEVCVCNRGHHDDDLKEFNTLIDDA 
LDGPLDFCDSCHVRPPGDEEEGLCQSSEEQAREPGHPHLPRPPACLLLNTINEQDSPN 
SQSSSSPS" 
misc_feature 28935. .28972 

/gene="C10orf26" 
/locus_tag="RPll-753C18 . 10-001" 

/note="Sequence from uni-directional dGTP big dye 
terminator reads only." 
polyA_signal 4 32 92. .4 3297 

/gene="C10orf26" 
/locus_tag="RPll-753C18. 10-001" 
polyA_site 43312 

/gene="C10orf26" 
/locus_tag="RPll-753C18. 10-001" 
polyA_site 4 3314 

/gene="C10orf26" 
/locus_tag="RPll-753C18. 10-001" 
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polyA_site 4 3317 

gene complement ( join ( 57581 . .58035,58558. .58661,59561. .59730, 

60043. .60258,61086. .61172,61835. .62064,62304. .62442, 
64115. .64583)) 
/gene="CYP17Al" 

/locus_tag="RPll-753C18. 1-001" 
mRNA complement (join (57581. .58035,58558. .58661,59561. .59730, 

60043. .60258,61086. .61172,61835. .62064,62304. .62442, 
64115. .64583)) 
/gene="CYP17Al" 

/locus_tag="RPll-753C18. 1-001" 

/product="cytochrome P4 50, family 17, subfamily A, 
polypeptide 1" 

/note="match: ESTs: AI638522.1 AU120851.1 AV701951.1 
AV704321.1 AV704538.1 AV705813.1 AV706535.1 AV706722.1 
AV707587.1 AV707827.i AV709994 . 1 AV710436.1 AV710723.1 
AV711490.1 BM559755.1 BM559917.1 

match: cDNAs: AF297650.1 BC032843.1 M14564.1 S75277.1" 
polyA_site complement ( 57581 ) 

/gene="CYP17Al" 

/locusJ:ag="RPll-753C18. 1-001" 
CDS complement( join (57752. .58035,58558. .58661,59561. .59730, 

60043. .60258,61086. .61172,61835. .62064,62304. .62442, 
64115. .64411)) 
/gene="CYP17Al" 

/locus_tag="RPll-753C18. 1-001" 
/standard_name="OTTHUMP00000020382" 
/note="match: proteins: P05093 Q95M62 Q9GLD2" 
/codon_start=l 

/product="cytochrome P4 50, family 17, subfamily A, 

polypeptide 1" 

/protein_id="CAI524 98 . 1 " 

/db_xref="GI: 58864793" 

/db_xref="InterPro: IPR001128" 

/db_xref="InterPro:IPR002401" 

/ translation="MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLP 
RHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQMATL 
D I ASNNRKGIAFADSGAHWQLHRRLAMAT FALFKDGDQKLEKI ICQE I STLCDMLATH 
NGQSIDISFPVFVAVTNVISLICFNTSYKNGDPELNVIQNYNEGI IDNLSKDSLVDLV 
PWLKIFPNKTLEKLKSHVKIRNDLLNKILENYKEKFRSDSITNMLDTLMQAKMNSDNG 
NAGPDQDSELLSDNHILTTIGDIFGAGVETTTSWKWTLAFLLHNPQVKKKLYEEIDQ 
NVGFSRTPTISDRNRLLLLEATIREVLRLRPVAPMLIPHKANVDSSIGEFAVDKGTEV 
I INLWALHHNEKEWHQPDQFMPERFLN PAGTQLISPSVSYLPFGAGPRSCIGEILARQ 

ELFLIMAWLLQRFDLEVPDDGQLPSLEGI PKWFLIDSFKVKIKVRQAWREAQAEGST 
»» 

gene complement ( join ( 57973 . .58035,58558. .58661,59561. .59730, 

60043. .60206,61086. .61172,61835. .61891)) 
/gene="CYP17Al" 

/locus_tag="RPll-753C18. 1-003" 
mRNA . complement (join (57973. .58035,58558. .58661,59561. .59730, 

60043. .60206,61086. .61172,61835. .61891)) 
/gene="CYP17Al" 

/locus_tag="RPll-753C18 .1-003" 

/product="cytochrome P4 50, family 17, subfamily A, 
polypeptide 1" 

/note="match: ESTs: AV704788.1 AV705037.1 AV706512.1 
AV707587.1 AV708270.1" 
gene complement ( join ( 57998 . .58035,58558. .58753)) 

/gene="CYP17Al" 

/locus_tag="RPll-753C18. 1-004" 
mRNA complement (join (57998. .58035,58558. .58753)) 

/gene="CYP17Al" 

/locus_tag="RPll-753C18. 1-004" 

/product="cytochrome P4 50, family 17, subfamily A, 
polypeptide 1" 

/note="match: ESTs: BF737959.1" 
gene join{<59771. .59921,61227. .61566) 

/locus_tag="RPll-753C18. 3-001" 
mRNA join(<59771. .59921,61227. .61566) 

/locus_tag="RPll-753C18. 3-001" 

/product="novel protein" 

/note="match: ESTs: BF529138.1 BG182618.1 BG200234 . 1 
BG201248.1 BG213774.1 BG219080.1 BG220558 . 1 R16838.1" 
CDS join(<59847. .59921,61227. .61466) 

/locus_tag="RPll-753C18. 3-001" 
/standard name="OTTHUMP00000020383" 
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/codon_start=l 

/product="novel protein" 

/protein_id="CAI524 97.1" 

/db_xref="GI : 58864792" 

/db_xre f = "Uni ProtKB/T rEMBL : Q5F2G5 " 

/ translation="WLWGPGPASRRWGSSTLGSKPTTAWLPESLLEADLSFLSLEVPT 
PPLTFSQNGRQDGDTERVYHDDEDNSKQGRRAWSPPLLPHSIKHQALTLLCVQPLGVR 
ET " 

gene complement { join ( 60525 . .62064,62304. .62643,64115. .64464)) 

/gene-"CYP17Al" 

/locus_tag="RPll-753C18 . 1-002" 
mRNA - complement(join{60525. .62064,62304. .6264 3,64115. .64464)) 

/gene="CYP17Al" 

/locus_tag="RPll-753C18. 1-002" 

/product="cytochrome P4 50, family 17, subfamily A, 
polypeptide 1" 

/note="match: cDNAs: AK094106.1" 
polyA_signal 61531. .61536 

/locus_tag="RPll-753C18. 3-001" 
polyA__site 61558 

/locus_tag="RPll-753C18. 3-001" 
polyA_site 61566 



Query Match 73.1%; Score 789.8; DB 5; 

Best Local Similarity 99.7%; Pred. No. 3e-223; 
Matches 791; Conservative 0; Mismatches 2; 



Length 131753; 



Indels 



0 ; Gaps 



0; 



Qy 288 TTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCC 347 

III I I I I I I I I I I I I I I I I 1 I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 39640 TTTAGGGTTTTTGCCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCC 39699 



Qy 



34 8 AACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACA 4 07 



start | next page 
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SCORE Search Results Details for Application 
10729895 and Search Result 
20070109_162355._us-10-729-895a-l.rst. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 

20070109_162355_us-10-729-895a-l.rst. 

start | next page 

Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: January 11, 2007, 01:10:13 ; Search time 5925 Seconds 

(without alignments) 
10192.891 Million cell updates/sec 

Title: US-10-72 9-895A-1 

Perfect score: 1080 

Sequence: 1 atgcctttccttttgggtct aaagcaaccagggtagggga 1080 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 48236798 seqs, 27959665780 residues 

Total number of hits satisfying chosen parameters: 96473596 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : EST:* 



1 




gb estl:* 


2 




gb_est3:* 


3 




gb_est4 : * 


4 




gb_est5: * 


5 




gb est6:* 


6 




gb_htc:* 


7 




gb_est2:* 


8 




gb_est7: * 


9 




gb_est8 : * 


10: 


gb_est9:* 


11: 


gb_gssl : * 


12: 


gb_gss2 :* 


13: 


gb_gss3:* 


14 : 


gb_gss4 :* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 

1 1027.8 95.2 2792 6 CR859701 CR859701 Pongo pyg 

2 786 72.8 2603 6 AK052400 AKO5240O Mus muscu 
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3 


786 


72. 


,8 


2702 


6 


AK078491 


AK078491 


Mus muscu 


4 


786 


72. 


,8 


3871 


6 


AK142695 


AK142695 


Mus muscu 


5 


766.8 


71. 


0 


3893 


6 


AK154686 


AK154 68 6 


Mus muscu 


6 


7 58 


70. 


2 


3873 


6 


AK087487 


AK087487 


Mus muscu 


7 


708 .2 


65. 


. 6 


735 


9 


DB293556 


DB293556 


DB293556 


8 


683 


63. 


2 


800 


8 


CX163170 


CX163170 


HESC2_2_F 


9 


660.8 


61. 


2 


3955 


6 


AK169294 


AK169294 


Mus muscu 


10 


587 


54. 


4 


587 


9 


DA190287 


DAI 90287 


DA190287 


11 


578 


53.5 


712 


8 


CN427264 


CN427264 


170005321 


12 


575 


53. 


2 


575 


9 


DA553446 


DA55344 6 


DA553446 


13 


571 .6 


52. 


9 


581 


3 


BP242855 


BP242855 


BP242855 


14 


557 


51. 


6 


565 


3 


BP221694 


BP221694 


BP221694 


15 


550 


50.9 


550 


9 


DA576326 


DA576326 


DA576326 


16 


550 


50. 


9 


550 


9 


DB283277 


DB283277 


DB283277 


17 


541 


50.1 


541 


9 


DB133804 


DB133804 


DB133804 


18 


531 . 6 


49. 


2 


852 


9 


DN935380 


DN935380 


AGEN COURT 


19 


526.4 


48. 


7 


876 


9 


DN948780 


DN948780 


AGEN COURT 


20 


519.4 


48. 


1 


'566 


9 


DA806792 


DA806792 


DA8067 92 


21 


512 


47. 


4 


1006 


3 


BQ054184 


BQ054184 


AGENCOURT 


22 


510.4 


47. 


3 


680 


8 


CV868359 


CV868359 


PDUtsl054 


23 


4 97.4 


46.1 


796 


5 


CF737347 


CF737347 


UI-M-HD0- 


24 


496.4 


46. 


0 


553 


9 


DA760493 


DA760493 


DA760493 


25 


493.4 


45. 


7 


560 


9 


DA756667 


DA756667 


DA756667 


26 


493.4 


45. 


7 


762 


4 


CA316710 


CA316710 


UI-M-FW0- 


27 


488 


45. 


2 


488 


9 


DA596718 


DA596718 


DA596718 


28 


466.4 


43. 


2 


563 


9 


DA321605 


DA321605 


DA321605 


29 


459.2 


42. 


5 


844 


8 


CV118164 


CV118164 


AGENCOURT 


30 


454 .8 


42. 


1 


525 


9 


CX597272 


CX597272 


CT020017B 


31 


4 50 


41. 


7 


4 50 


14 


AY405369 


AY405369 Homo sapi 


32 


447 . 6 


41 . 


4 


625 


4 


CA334 353 


CA334353 


NISC ls08 


33 


444 


41. 


1 


537 


9 


DB129660 


DB129660 


DB129660 


34 


442 .4 


41. 


0 


837 


3 


BQ924256 


BQ924256 


AGENCOURT 


35 


437 


40. 


5 


580 


9 


DA823881 


DA823881 


DA823881 


36 


4 30.8 


39. 


9 


434 


4 


CB145438 


CB145438 


K-EST0200 


37 


428 .4 


39. 


7 


850 


9 


DB292857 


DB292857 


DB292857 


38 


425 


39. 


4 


565 


9 


DA196239 


DA196239 


DA196239 


39 


423.2 


39. 


2 


557 


9 


DA860698 


DA8 60698 


DA860698 


40 


417 


38. 


6 


556 


9 


DB226414 


DB226414 


DB226414 


41 


415.2 


38. 


4 


551 


9 


DA887248 


DA887248 


DA887248 


42 


414 


38. 


3 


553. 


9 


DA723361 


DA723361 


DA723361 


43 


410.2 


38. 


0 


716 


3 


BU701616 


BU701616 


UI-M-FI0- 


44 


410 


38. 


0 


410 


1 


AI459545 


AI459545 


ao86gll .x 


45 


409.4 


37. 


9 


580 


9 


DN747882 


DN747882 


GL-Cf-467 



ALIGNMENTS 



RESULT 1 
CR859701 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



• REFERENCE 
AUTHORS 

CONSRTM 

TITLE 

JOURNAL 



COMMENT 



CR859701 2792 bp mRNA linear HTC 12-NOV-2004 

Pongo pygmaeus mRNA; cDNA DKFZp4 69N2325 (from clone DKFZp4 69N2325) . 
CR859701 

CR859701.1 GI:55730274 
HTC. 

Pongo pygmaeus (orangutan) 
Pongo pygmaeus 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Ca tarrhini; 
Hominidae; Pongo. 
1 {bases 1 to 2792) 

Wambutt,R., Heubner,D., Mewes,H.W., Weil,B., Amid,C, Osanger,A., 
Fobo,G., Han,M. and Wiemann,S. 
The German cDNA Consortium 
Direct Submission 

Submitted ( 12-NOV-2004 ) MIPS, Ingolstaedter Landstr.l, D-85764 
Neuherberg, GERMANY 

Clone from S. Wiemann, Molecular Genome Analysis, German Cancer 
Research Center (DKFZ); Email s.wiemann@dkfz-heidelberg.de; 
sequenced by Agowa (Berlin/Germany) within the cDNA sequencing 
consortium of the German Genome Project. 
This clone ( DKFZp4 69N2325) is available at 
Ressourcenzentrum fuer Genomforschung GmbH 
Please contact RZPD for ordering: 

http: //www. rzpd. de/cgi-bin/products/cl . cgi?CloneID=DKFZp4 69N2325 
Further information about the clone and the sequencing project is 



the RZPD Deutsches 
in Berlin, Germany. 
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available at http://mips.gsf.de/projects/cdna/. 
FEATURES Location/Qualifiers 
source 1. .2792 

/organism="Pongo pygmaeus." 
/mol_type="mRNA" 
/db_xref="taxon : 9600" 
/clone="DKFZp4 69N2325" 
/tissue_type=" kidney" 

/clone_lib="4 69 (synonym: pkidl). Vector pSportl_Sfi; host 

DH10B; sites SfilA + SfilB" 

/dev_stage-"adult" 

/note="hypothetical protein (Mus musculus), differentially 
spliced" 

gene 1. .2792 

/gene="DKFZp4 69N2325" 
CDS 43. .360 

/gene="DKFZp4 69N2325" 

/codon_start=l 

/product-"hypothetical protein" 
/protein_id="CAH918 60 . 1 " 
/db_xref="GI : 55730275" 

/translation= "MERRRLLGGMALLLLQALPSPLSARAEPPQL FGGASCGSLSCVW 
LTTPALLVQYWGFQGRIRKPVWVPTIKATSVTQDTAVDSLSAATTTMNSGGSGWCGPS 
SSS" 

ORIGIN 

Query Match 95.2%; Score 1027.8; DB 6; Length 2792; 

Best Local Similarity 97.5%; Pred. No. 1.3e-255; 

Matches 104 4; Conservative 0; Mismatches 27; Indels 0; Gaps 0; 

Qy 8 T C CT TT T GGGT CT T AG AC AGGAT AAGGAAGCCTGT GT GGGT ACC AAC AAT C AAAGCT AC A 67 

I I I I I E f I I I I I I I I I I I I I I I I I I I I I I I I I 1 II I I I I I I I I I I I I I I I . 

Db 202 TACTGGGGGTTTCAGGGCAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAAAGCTACA 2 61 

Qy 68 T C T GT GAC AC AGGACACT GCT GT GGAC AGT CT C AGT GCT GC AACT ACT ACT ATG AACT CT 127 

I I I I I I I I I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i 

Db 262 T C T GT GAC AC AGGACACT GCT GT GGAC AGT CT C AGT GCT GC AACT ACT ACT AT G AACT CT 321 

Qy 128 GGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTTTGCCACC 187 

I I I I I I I I I I I I I I I I I I i I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 322 GGT GGT T CT GGCT GGT GT GGAC CAT CAT CAT CAT CCTGAGCT GCT GCT GTGTTTGCC ACC 381 

Qy 188 ACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCG 24 7 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 1 1 I H I I I I I I I I I I I I I I I I I I I I I I I 
Db 382 ACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAACCTGATCG 441 

Qy 24 8 CTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACT 307 

I I I I I i I I I I M I I I I I I I I I I I I I I M I I M I i I I I I I I I I I II I I M I I I I I I I I I I 

Db 442 CCTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTGCCAAACT 501 

Qy 308 ATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACA 367 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii i 

Db 502 ATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCACCATACA 561 

Qy 368 GTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCA 4 27 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 562 GTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTGCAGGTGGCA 621 

Qy • 428 GT CCCC CGGGCAT C GAT C CC AC C AGGGG AT CC C AGGGGGC AC AG AGC AGCC C CT T GT C T G 487 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
Db 622 GTCCCCCGGGCGTTGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCCCCTTGTCTG 681 

Qy 488 AGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGATCCCTCTGACCTAC 547 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 682 GGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGACCCTGAGCCCTCTGACCTAC 741 

Qy 54 8 CAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCCAGT GGCT CTGT GGCT GGCC 607 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I II I 
Db 74 2 C AGT T GAACG AGC AGC CACCAAAGCCCCGGGG AT GG AGCC C AGT GGCT CTGT GGCT GGCC 801 

Qy 608 TGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGGAGGAGCTGC 667 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I II I I I I I I I I 
Db 802 TGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAGGATGCAGAATGTAGGGAGGAGCTGC 861 

Qy 668 TGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGAC 7 27 

I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
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Db 862 TGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAAAGAGAAGACGCCTGGGAGAC 921 

Qy 728 ATCGCCGCTTCACAGGTGACTCGGGCAT.TGAAGTGTGTGTGTGCAACCGGGGCCACCATG 787 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I 
Db 922 ATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGTGTGCAACCGGGGCCACCATG 981 

Qy 788 ACGATGACCTCAAAGAGTTCAACACACTCATCGATGATGCTCTGGATGGGCCCCTGGACT 84 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 
Db 982 ACGATGACCTCAAAGAGTTCAACACGCTCATCGATGATGCTCTGGATGGGCCCCTGGACT 1041 

Qy 848 TCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCTGTCAGTCCT 907 

! I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I J I I I I I I I I I I I III III 
Db 1042 TCTGCGATAGCTGCCATGTGCGGCCCCCTGGTGACGAGGAGGAAGGCCTCTGCCAGCCCT 1101 

Qy 908 CTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGC 967 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1102 CCGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCGCATGCCTGC 1161 

Qy 968 TGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCXCCCCCAGCT 1027 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 

Db 1162 TGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCAGAGCAGCAGCTCCCCCAGCT 1221 

Qy 1028 AGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGGTAGGG 1078 

II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I til 

Db 1222 AGAGCAGGCCCTGCCAGCACCCAGGAACTTGGCAAAGCAACCAGGGTGGGG 1272 



RESULT 2 
AK052400 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS * 
SOURCE 
ORGANISM 



REFERENCE . 

AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 



AK052400 2603 bp mRNA linear HTC 02-SEP-2005 

Mus musculus 13 days embryo -lung cDNA, RIKEN full-length enriched 
library, clone : D4 30003C10 product : hypothetical Proline-rich 
region/Cytochrome c family heme-binding site containing protein, 
full insert sequence. 
AK052400 

AK052400.1 GI:26342634 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus muscul'us 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y . 
High-efficiency full-length cDNA cloning 
Meth. Enzymol. 303, 19-44 (1999) 
10349636 
2 

Carninci,P., Shibata, Y., Hayatsu,N., Sugahara, Y ., Shibata, K. , 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 

Genome Res. 10 (10), 1617-1630 (2000) , 

11042159 

3 

Shibata, K. , Itoh,M., Aizawa,K., Nagaoka,S., Sasaki, N., Carninci,P., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A., 
Yamamoto,R., Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K. , 
Fujiwake, S. , Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 
RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 mul ticapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
11076861 
4 

The RIKEN Genome Exploration Research Group Phase II Team and the 
FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409, 685-690 (2001) 

5 

The FANTOM Consortium, the RIKEN Genome Exploration Research Group 
Phase I and II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
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JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 

AUTHORS RIKEN Genome Exploration Research Group, Genome Science Group 

{Genome Network Core Team) and the FAN TOM Consortium. 
TITLE Antisense Transcription in the Mammalian Transcriptome 

JOURNAL Science 309, 1564-1566 (2005) 
REFERENCE 7 

AUTHORS The FANTOM Consortium, Riken Genome Exploration Research Group and 
Genome Science Group (Genome Network Project Core Group) . 

TITLE The Transcriptional Landscape of the Mammalian Genome 

JOURNAL Science 309, 1559-1563 (2005) 
REFERENCE 8 (bases 1 to 2603) 

AUTHORS Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani,K., Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh,H., Kawai,J., Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Matsuyama, T . , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K. , Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A., Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki , Y . 

TITLE Direct Submission 

JOURNAL Submitted ( 16- JUL-2001 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN), Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC), 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa, 230-0045, Japan ( E-mail : genome-res@gsc . riken . jp, 
URL: http:/ /genome. gsc.riken.jp/, Tel : 8 1-4 5-503-9222 , 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : //genome . gsc. riken . jp/ 
URL: http://fantom.gsc. riken. jp/ . 
FEATURES Location/Qualifiers 
source 1. .2603 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref="FANTOM_DB:D4 30003C10" 

/ db_x re f =" taxon : 1 00 90 " 

/clone="D430003C10" 

/tissue_type="lung" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="13 days embryo" 
CDS 143. .1189 

/note="unnamed protein product; hypothetical Proline-rich 

region/Cytochrome c family heme-binding site containing 

protein ( InterPro | IPR000694 InterPro | IPR00034 5, evidence: 

InterPro) 

putative" 

/codon_start=l 

/protein_id="BAC34974 .1" 

/db_xref="GI : 26342635" 

/ translation="MPFLWGLRQDKEACVGTNNQSYICDTGHCCGQSQCCNYYYELWW 
FWLVWTWIILSCCCVCHHRRAKHRLQAQQRQHEINLIAYREAHNYSALPFYFRFLPN 
SLLPPYEEWNRPPTPPPPYSAFQLQQQQQLLPPPPQGGPPGGSPPGADPPPQGSQGA 
QSSPLSGPSRSSTRPPSVADPQSPEVPTDREATKASGTESGSPMAGHGELDPGAFLDQ 
DSECKEELLKDSRSERGGVSPDSEDKTPGRHRRFTGDSGIEVCVCNRGHHDDDLKEFN 
TLIDDALDGPLDFCDSCHVRPPVDEEEGLCLSSEGQAREHGHPHLPRPPACLLLNTIN 
EQDSPNSQHSGSPS" 

polyA_signal 2584. .2589 

/note="putative" 

polyA_site 2 603 

/note="putative" 

ORIGIN 



Query Match 72.8%; 
Best Local Similarity 85.0%; 
Matches 933; Conservative 



Score 78 6; DB 6; Length 2603; 
Pred. No. 9.6e-193; 
0; Mismatches 145; Indels 20; 



Gaps 



4; 
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Qy 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

I I I I I $ I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 14 3 ATGCCTTTCCTCTGGGGGCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 202 

Qy 61 AGCT AC AT CT GT GAC AC AGG AC ACT GCT GT GG AC AGT C T C AGT GCT GC AACT ACT ACT AT 120 

I I I I I I I I I I I I I I I I M I I I I I I I I I I 1 1 I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I 

Db 203 AGCT AC AT CT GT GA C AC AGG AC ACT GCT GT GGAC AGT C T C AGT GCT GC AACT AC TACT AT 262 

Qy 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii mini 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 

Db 263 GAACTCTGGTGGTTCTGGCTGGTGTGGACCGTCGTCATCATTCTGAGCTGCTGCTGTGTC 322 

Qy 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 323 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCTCAGCAGCGGCAACATGAAATCAAC 382 

Qy 241 CT GATC GCTT AC CG AG AAGC C C AC AAT T ACTC AGCGCT GC CAT T TT AT T T C AGGTT TT T G 300 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 383 CTGATCGCTTACCGGGAAGCCCACAACTACTCAGCGTTGCCGTTTTACTTCAGGTTTTTG 442 

Qy 301 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 360 

I I I I I I I III I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 44 3 CCAAACTCTTTGCTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 502 

Qy 361 CCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTG CCTC CACAGT GT 411 

I I I 1 1 I ! 1 1 1 1 I II 1 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 I 1 1 I I I 

Db 503 CCGT AC AGTGCCTTCC AGCT CCAACAGCAGCAGC AGCT GCTGCCACCTCCTCCTCAGGGT 562 

Qy 412 GGCC CT GC AGGT GGCAGT CCCC CGGGCATCGA TCCCACCAGGGGATCCCAGGGGGCA 4 68 

I I I I I I I I I I I I I I I I I I I I I I I I II II II MINIMUM 

Db 563 GGCCCTCCAGGTGGCAGCCCCCCAGGCGCTGACCCGCCGCCCCAGGGCTCCCAGGGAGCG 622 

Qy 469 CAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGAC 528 

I I I I I I I I II Mil! t II III I I I J I I I I I I I I I I I I I I I I I I I I I II 
Db 623 CAGAGCAGTCCTTTGTCCGGACCGAGCCGAAGCAGCACAAGACCTCCAAGTGTCGCAGAT 682 

Qy 52 9 CCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCC 588 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 683 CCTCAGTCCCCTGAAGTGCCCACTGACCGGGAAGCCACCAAAGCCTCTGGGACGGAGTCT 74 2 

Qy 589 AGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCA 648 

II II I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I Ml I ' 
Db 74 3 GGTAGCCCCATGGCAGGCCACGGGGAGCTGGACCCGGGCGCCTTCCTGGACCAGGATTCC 802 

Qy 64 9 GAAT GT AGGGAGGAGCTGCT GAAAGAT GAC AGCT CT GAAC ACGGCG CACCCGAC 7 02 

I I I I I I I I I I I I I I I I I II II I 

Db 803 GAGT GC AAGGAGGAGCT ACT GAAAGATT CGCGTT CT GAGCGT GGCGGCGT GT CCCCTGAC 862 

Qy 703 AGC AAAGAGAAGAC GC CT GGGAGAC AT C GC CGCT T C AC AGGT GACT CGGGC ATT GAAGT G 7 62 

II I I I I I I I I I II II II I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I II III 

Db 863 AGT G AAGACAAGAC CC CC GGC AGGC AT C GC CGCT T C AC AGGT GACT C AGGC ATT GAGGTG 922 

Qy 763 TGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGAT 822 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 923 TGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTTAACACGCTCATAGAT 982 

Qy 823 GAT GCT CT GGAT GGGCCCCT GGACTT CT GC GACAGCTGCC AT GTGCGGCCCC CT GGTGAT 882 

I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mil I I I I 

Db 983 GATGCTCTGGACGGGCCCCT GGACTT CTGTGACAGCTGTCATGTACGGCCTCCTGTTGAT 104 2 

Qy 883 GAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCAC 94 2 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 104 3 GAAGAGGAAGGTCTCTGCCTGTCCTCGGAGGGTCAGGCTCGGGAGCATGGGCACCCCCAC 1102 

Qy 94 3 CTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAAC 1002 

III I II II I I I I I I II I I 1 I I I I I I I I I I I I II } I I I I I I I II 

Db 1103 CTGCCACGGCCGCCTGCGTGTCTGCTGCTAAACACCATCAATGAGCAGGACTCTCCAAAT 1162 

Qy 1003 TCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAA 1062 

I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1163 TCCCAGCACAGTGGCTCCCCGAGCTAGGACACGCCCTGCCTGCACCTAGGAGCTCAACGA 1222 

Qy 1063 AGCAACCAGGGTAGGGGA 1080 

I I I I I I I I I I I I I I 
Db 1223 — CAACCAGGGTCAGGGA 1238 
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RESULT 3 
AK0784 91 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
REFERENCE 
' AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 



AK078491 2702 bp mRNA linear HTC 02-SEP-2005 

Mus musculus 12 days embryo female mullerian duct includes 
surrounding region cDNA, RIKEN full-length enriched library, 
clone: 6820415N12 product : hypothetical Proline-rich 
region/Cytochrome c family heme-binding site containing protein, 
full insert sequence. 
AK078491 

AK078491.1 GI:26347312 
HTC; CAP trapper. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Eu teleos tomi ; 
Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol. 303, 19-44 (1999) 
10349636 
2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 
Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 
Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new genes 
Genome Res. 10 (10), 1617-1630 (2000) 
11042159 



Shibata,K., Itoh,M., Aizawa,K., 
Konno,H., Akiyama,J., Nishi,K., 
Sumi,N., Ishii,Y., Nakamura,S., 



Nagaoka,S., Sasaki, N., Carninci,P., 
Kitsunai,T., Tashiro,H., Itoh,M., 
Hazama,M., Nishine,T., Harada,A., 
Yamamoto,R., Matsumoto, H . , Sakaguchi , S . , Ikegami,T., Kashiwagi , K . , 
Fujiwake,S., Inque,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,.S . , Kawai,J., 
Okazaki,Y., Murama tsu, M. , Inoue,Y., Kira,A. and Hayashizaki, Y. 
RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 multicapillary sequencer 
Genome Res. 10 (11), 1757-1771 (2000) 
11076861 
4 

The RIKEN Genome Exploration Research Group Phase II Team and the 
FANTOM Consortium. 

Functional annotation of a full-length mouse cDNA collection 

Nature 409, 685-690 (2001) 

5 

The FANTOM Consortium, the RIKEN Genome Exploration Research Group 
Phase I and II Team. 

Analysis of the mouse transcriptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420, 563-573 (2002) 
6 

RIKEN Genome Exploration Research Group, Genome Science Group 
(Genome Network Core Team) and the FANTOM Consortium. 
Antisense Transcription in the Mammalian Transcriptome 
Science 309, 1564-1566 (2005) 
7 

The FANTOM Consortium, Riken Genome Exploration Research Group and 
Genome Science Group (Genome Network Project Core Group) . 
The Transcriptional Landscape of the Mammalian Genome 
Science 309, 1559-1563 (2005) 
8 (bases 1 to 2702) 

Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 
Fukuda,S., Furuno,M., Hanagaki,T., Hara,A., Hashizume, W. , 
Hayashida, K. , Hayatsu,N., Hiramoto,K., Hiraoka,T., Hirozane,T., 
Hori,F., Imotani, K. , Ishii,Y., Itoh,M., Kagawa,I., Kasukawa,T., 
Katoh, H. , Kawai, J. , Kojima,Y., Kondo,S., Konno,H., Kouda,M., 
Koya,S., Kurihara,C, Ma tsuyama, T . , Miyazaki,A., Murata,M., 
Nakamura,M., Nishi,K., Nomura, K., Numazaki,R., Ohno,M., Ohsato,N., 
Okazaki,Y., Saito,R., Saitoh, H., Sakai,C, Sakai,K., Sakazume,N., 
Sano,H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y., Tagami,M., Tagawa,A., Takahashi, F. , Takaku-Akahira, S . , 
Takeda,Y., Tanaka,T., Tomaru,A., Toya,T., Yasunishi, A. , 
Muramatsu,M. and Hayashizaki, Y. 
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TITLE Direct Submission 

JOURNAL Submitted ( 16-APR-2002 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research ( RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC), 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa, 230-0045, Japan (E-mail :genome-res@gsc. riken. jp, 
URL: http://genome.gsc. riken.jp/, Tel : 8 1-45-503-9222 , 
Fax:81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN . 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL: http://genome.gsc. riken. jp/ 
URL:http: //fantom.gsc. riken.jp/. 
FEATURES Location/Qualifiers 
source 1. .2702 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref="FANT0M_DB : 68204 15N12" 

/db_xref="taxon : 10090" 

/clone="6820415N12" 

/sex=" female" 

/tissue_type="mullerian duct includes - surrounding region" 
/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="12 days embryo" 
CDS 154. .1200 

/note="unnamed protein product; hypothetical Proline-rich 

region/Cytochrome c family heme-binding site containing 

protein { InterPro | I PR000694 InterPro | IPR00034 5, evidence: 

InterPro) 

putative" 

/codon_start=l 

/protein_id="BAC37305 .1" 

/db_xref="GI: 2634 7313" 

/ translation= "MPFLWGLRQDKEACVGTNNQSYICDTGHCCGQSQCCNYYYELWW 

FWLVWTWI ILSCCCVCHHRRAKHRLQAQQRQHEINLIAYREAHNYSALPFYFRFLPN • 

SLLPPYEEWNRPPTPPPPYSAFQLQQQQQLLPPPPQGGPPGGSPPGADPPPQGSQGA 

QSSPLSGPSRSSTRPPSVADPQSPEVPTDREATKASGTESGSPMAGHGELDPGAFLDQ 

DSECKEELLKDSRSERGGVSPDSEDKTPGRHRRFTGDSGIEVCVCNRGHHDDDLKEFN 

TLIDDALDGPLDFCDSCHVRPPVDEEEGLCLSSEGQAREHGHPHLPRPPACLLLNTIN 

EQDSPNSQHSGSPS" 

•ORIGIN 



Query Match 72.8%; Score 786; DB 6; Length 2702; 

Best Local Similarity 85.0%; Pred. No. 9.7e-193; 

Matches 933; Conservative 0; Mismatches 145; Indels 20; Gaps 4; 



Qy 


i 


ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


154 


AT GC CT TT CCT CT GGGGGCT T AGAC AGG AT AAGG AAGC CT GT GT GGGT AC C AAC AAT C AA 


213' 


Qy 


61 


AGCT AC AT CT GT GAC AC AGG AC ACT GCT GT GG AC AGT CT C AGT GCT GC AACT AC T ACT AT 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


214 


AGCT ACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGC AACT ACTACT AT 


273 


Qy 


121 


GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


274 


GAAC T CT GGT GGTT CT GGCT GGT GTGGACC GT C GT C AT C ATT CT GAGCT GCT GC T GT GT C 


333 


Qy 


181 


TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 




Db 


334 


T GCC AC C ACCGC CG AGCC AAGC AC CGCCTT C AGGCT C AGC AGCGGC AAC AT GAAAT C AAC 


393 


Qy 


241 


CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 MM 1 II 1 1 1 1 II 1 1 II 1 1 M 




Db 


394 


CTGATCGCTTACCGGGAAGCCCACAACTACTCAGCGTTGCCGTTTTACTTCAGGTTTTTG 


4 53 


Qy 


301 


CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


360 






1 II 1 1 1 1 III 1 M 1 II 1 1 1 II 1 1 1 1 1 1 ! 1 1 1 1 II II 1 1 1 II 1 1 M 1 II M 1 1 II II II 




Db 


454 


CCAAACTCTTTGCTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 


513 


Qy 


361 


CC AT AC AGT GCCTTCC AGCT AC AGC AGC AGC AGCT GCT G CCTCCACAGTGT 


411 



.11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I.I 1 1 1 1 1 1 1 1 1 1 1 1 III II 
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Db 514 CCGTACAGTGCCTTCCAGCTCCAACAGCAGCAGCAGCTGCTGCCACCTCCTCCTCAGGGT 573 

Qy 412 GGCCCTGCAGGTGGCAGTCCCCCGGGCATCGA TCCCACCAGGGGAT CCCAGGGGGCA 4 68 

I I I I I I I I I I I I I II I I I I I I I I I 'II II II I I I I I I I I I I I II. 

Db 57 4 GGCCCTCCAGGTGGCAGCCCCCCAGGCGCTGACCCGCCGCCCCAGGGCTCCCAGGGAGCG 633 

Qy 4 69 CAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGAC 528 

I I I I I I I I II Mill I II III I I I I I I I I I I I I I I I I I I I I I I II I II 
Db 634 CAGAGCAGTCCTTTGTCCGGACCGAGCCGAAGCAGCACAAGACCTCCAAGTGTCGCAGAT 693 

Qy 52 9 CCTGATCCCT.CTGACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCC 588 

Nil I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 694 CCTCAGTCCCCTGAAGTGCCCACTGACCGGGAAGCCACCAAAGCCTCTGGGACGGAGTCT 7 53 

Qy 589 AGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCA 648 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db .754 GGTAGCCCCATGGCCGGCCACGGGGAGCTGGACCCGGGCGCCTTCCTGGACCAGGATTCC 813 

Qy 64 9 GAAT GT AGGGAGGAGCT GCT GAAAGAT G AC AGCT CT GAAC AC GGC G CACCCGAC 702 

I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I 

Db 814 GAGTGCAAGGAGGAGCTACTGAAAGATTCGCGTTCTGAGCGTGGCGGCGTGTCCCCTGAC 873 

Qy 703 AGCAAAGAGAAGAC GCCT GGGAGAC ATCGCCGCT TC AC AGGT GACT CGGGCATT GAAGTG 7 62 

II I I I I I I I I I II II II I I I II I -I I I I I I I I I I I I I I I I I I I I I I II I III 

Db 874 AGTGAAGACAAGACCCCCGGCAGGCATCGCCGCTTC AC AGGT GACT CAGGCATTGAGGTG 933 

Qy 763 TGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGAT 822 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 934 TGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTTAACACGCTCATAGAT 993 

Qy 823 GATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGAT 882 

I I I I I I I I I I I I I I I I I I I II I I II I I I I II II I I I I I I I I I I I I I II I I MM 

Db 994 GAT GCT CT GGAC GGGC CC CT GGACTT CT GT GAC AGCT GT CAT GT ACGGCCT C CT GT T GAT 1053 

Qy 883 GAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCAC 942 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml 

Db 1054 GAAGAGGAAGGTCTCTGCCTGTCCTCGGAGGGTCAGGCTCGGGAGCATGGGCACCCCCAC 1113 

Qy .94 3 CTGCCACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAAC 1002 

I II I I I M M I I I I II II I I I I I I I I I I I I I I II I I I I II I II I I I I I I I I II 
Db 1114 CTGCCACGGCCGCCTGCGTGTCTGCTGCTAAACACCATCAATGAGCAGGACTCTCCAAAT 117 3 

Qy 1003 TCCC AGAGCAGC AGCT CCCC CAGCT AGAGC AGGT CCTGCC AGCACCCAGC AACT T GGC AA 1062 

I I I I I I III I I I M M I I I II I I I I I I I I I I M I I I II I I I II 
Db 117 4 TCCCAGCACAGTGGCTCCCCGAGCTAGGACACGCCCTGCCTGCACCTAGGAGCTCAACGA 1233 

Qy 1063 AGCAACCAGGGTAGGGGA 1080 

I I I I I I I I I I I I I I 
Db 1234 — CAACCAGGGT CAGGGA 124 9 



RESULT 4 
AK142695 
LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE ' 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 

REFERENCE 
AUTHORS 

TITLE 

JOURNAL 



AK142695 3871 bp mRNA linear HTC 21-SEP-2005 

Mus musculus 16 days neonate heart cDNA, RIKEN full-length enriched 
library, clone : D830007O22 product : hypothetical protein, full insert 
sequence. 
AK142695 

AK142695.1 GI:74216502 
HTC; CAP trapper.. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebra ta; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Glires; Rodentia; 
Sciurognathi; Muroidea; Muridae; Murinae; Mus. 
1 

Carninci,P. and Hayashizaki, Y. 
High-efficiency full-length cDNA cloning 
Meth. Enzymol. 303, 19-44 (1999) 
10349636 
2 

Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 
Itoh,M., Konno,H. # Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 
Normalization and subtraction of cap-trapper-selected cDNAs to 
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/strain="C57BL/6J" 

/ db_x re f = " FANTOM_DB :D830007O22" 

/ db_x re f = " t a xon : 1 00 90 " 

/clone="D830007022" 

/tissue_type= "heart" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="16 days neonate"- 
CDS 154. .1200 

/note="unnamed protein product; hypothetical protein 

(evidence: CDS) 

putative" 

/codon_start=l 

/protein_id="BAE25164 .1" 

/db_xref="GI: 74216503" 

/ translation="MPFLWGLRQDKEACVGTNNQSYICDTGHCCGQSQCCNYYYELWW 
FWLVWTWIILSCCCVCHHRRAKHRLQAQQRQHEINLIAYREAHNYSALPFYFRFLPN 
SLLPPYEEWNRPPTPPPPYSAFQLQQQQQLLPPPPQGGPPGGSPPGADPPPQGSQGA 
QSSPLSGPSRSSTRPPSVADPQSPEVPTDREATKASGTESGSPMAGHGELDPGAFLDQ 
DSECKEELLKDSRSERGGVSPDSEDKTPGRHRRFTGDSGIEVCVCNRGHHDDDLKEFN 
TLIDDALDGPLDFCDSCHVRPPVDEEEGLCLSSEGQAREHGHPHLPRPPACLLLNTIN 
EQDSPNSQHSGSPS" 

polyA_signal 384 6. .3851 

/note="putative" 

polyA_site 3871 

/note="putative" 

Query Match 72.8%; Score 786; DB 6; Length 3871; 

Best Local Similarity 85.0%;. Pred. No. l.le-192; 

Matches 933; Conservative 0; Mismatches 145; Indels 20; Gaps 4; 



Qy 1 ATGCCTTTCCTTTTGGGTCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 60 

I I I I II I I I I I I III I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 154 ATGCCTTTCCTCTGGGGGCTTAGACAGGATAAGGAAGCCTGTGTGGGTACCAACAATCAA 213 

Qy 61 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 120 

I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I i I I I I I I I I t I I I I I I I I I I I I I I I I I 
Db 214 AGCTACATCTGTGACACAGGACACTGCTGTGGACAGTCTCAGTGCTGCAACTACTACTAT 273 

Qy 121 GAACTCTGGTGGTTCTGGCTGGTGTGGACCATCATCATCATCCTGAGCTGCTGCTGTGTT 180 

i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I 
Db 274 GAACT CTGGT GGTT CT GGCT GGT GT GGACC GT C GT C AT C ATT CT GAGCT GCT GCT GTGT C 333 

Qy 181 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAAC 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I 
Db 334 TGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCTCAGCAGCGGCAACATGAAATCAAC 393 

Qy 241 CTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTATTTCAGGTTTTTG 300 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 394 CTGATCGCTTACCGGGAAGCCCACAACTACTCAGCGTTGCCGTTTTACTTCAGGTTTTTG 4 53 

Qy 301 CCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 360 

I I I I I I I III I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 4 54 CCAAACTCTTTGCTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCCCA 513 

Qy 361 CC AT ACAGTGCCTT CC AGCT AC AGCAGC AGCAGCTGCT G CCTCCACAGTGT 411 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II 

Db 514 CCGT ACAGTGCCTT CC AGCT CCAACAGCAGCAGCAGCTGCTGCCACCTCCTCCTCAGGGT 573 

Qy 412 GGCCCTGCAGGTGGCAGTCCCCCGGGCATCGA— TCCCACCAGGGGATCCCAGGGGGCA 468 

I I I I I I I I I I I I I I I I I I I I I I I I II it II I I I I I I I I I I I I I 

Db 574 GGCCCTCCAGGTGGCAGCCCCCCAGGCGCTGACCCGCCGCCCCAGGGCTCCCAGGGAGCG 633 

Qy 4 69 C AGAGC AGCC CCTT GT CT GAGC CC AGC AGAAGC AGC AC AAGAC C CC C AAGC AT C GC TG AC 528 

II I I I I I I II I I I I ( I I I I I I I I I I 1 I II I I II I I I I I I I 1 I I I I I I I 

Db 634 CAGAGCAGTCCTTTGTCCGGACCGAGCCGAAGCAGCACAAGACCTCCAAGTGTCGCAGAT 693 

Qy 529 CCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCCC 588 

I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 694 CCTCAGTCCCCTGAAGTGCCCACTGACCGGGAAGCCACCAAAGCCTCTGGGACGGAGTCT 7 53 

Qy 58 9 AGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCA 648 

II II I I I I I II I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I III I 
Db 754 GGTAGCCCCATGGCCGGCCACGGGGAGCTGGACCCGGGCGCCTTCCTGGACCAGGATTCC 813 

Qy 64 9 GAAT GT AGGGAGGAGCTGCTGAAAGATGAC AGCT CT GAAC ACGGCG CACCCGAC 702 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 814 GAGTGCAAGGAGGAGCTACTGAAAGATTCGCGTTCTGAGCGTGGCGGCGTGTCCCCTGAC 873 

Qy 703 AG C AAAGAGAAGAC GC CT GGGAGACATC GC CGCT T C AC AGGT GACT CGGGC ATT GAAGTG 762 

11 I I I I I I I I I II II II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I III 
Db 874 AGT GAAGAC AAGAC C C CC GGC AGGCATC GC C GCT T C AC AGGT GACT CAGGC ATT GAGGTG 933 

Qy 763 TGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGAT 822 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 1 I I I I I I I I I I I Ml 
Db 934 TGTGTGTGCAACCGGGGCCACCATGACGATGACCTCAAAGAGTTTAACACGCTCATAGAT 993 

Qy 823 GATGCTCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGAT 882 

I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 994 GATGCTCTGGACGGGCCCCTGGACTTCTGTGACAGCTGTCATGTACGGCCTCCTGTTGAT 1053 

Qy 883 GAGGAGGAAGGCCTCTGTCAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCAC 942 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I III 

Db 1054 GAAGAGGAAGGTCTCTGCCTGTCCTCGGAGGGTCAGGCTCGGGAGCATGGGCACCCCCAC 1113 

Qy 94 3 CT GCCACGGCCGCC CGCATGCCT GCT GCTGAACACC AT CAACGAGC AGGACT CT CC CAAC 1002 . 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I II 
Db 1114 CT GCCACGGCCGCCTGCGTGTCT GCT GCT AAAC ACC AT CAAT GAGC AGGACT CT CC AAAT 117 3 

Qy 1003 TCCCAGAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAA 1062 

I I I I I I III I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I II 
Db 1174 TCCCAGCACAGTGGCTCCCCGAGCTAGGACACGCCCTGCCTGCACCTAGGAGCTCAACGA 1233 

• Qy 1063 AGC AAC C AGGGT AGGGGA 1080 

I I I I I II I I I MM 
Db 1234 — CAAC C AGGGT CAGGGA 124 9 
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AK154686 3893 bp mRNA linear HTC 21-SEP-2005 

Mus musculus NOD-derived CDllc +ve dendritic cells cDNA, RIKEN 
full-length enriched library, clone : F630103G18 product : hypothetical 
protein, full insert sequence. 
AK154686 

AK154686.1 GI:74185772 
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Mus musculus (house mouse) 
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Genome Res. 10 (10), 1617-1630 (2000) 
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Saito,T., Okazaki,Y., Gojobori,T., Bono,H., Kasukawa,T., Saito,R., 
Kadota,K., Matsuda,H., Ashburner ,M. , Batalov,S". , Casavant,T., 
Fleischmann, W. , Gaasterland, T . , Gissi,C, King,B., Kochiwa,H. f 
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Kuehl,P., Lewis, S., Matsuo,Y., Nikaido,I., Pesole,G., 
Quackenbush, J. , Schriml, L.M. , Staubli,F., Suzuki, R., Tomita,M., 
Wagner, L., Washio,T., Sakai,K., Okido,T., Furuno,M., Aono,H., 
Baldarelli,R. , Barsh,G., Blake, J. , Boffelli,D., Bojunga,N., 
Carninci,P., de Bonaldo, M. F. , Browns te in, M. J. , Bult,C, 
Fletcher, C, Fujita, M. , Gariboldi, M. , Gustincich, S . , Hill,D., 
Hofmann,M., Hume, D. A., Kamiya,M., Lee, N . H . , Lyons, P., 
Marchionni,L. , Mashima,J., Mazzarelli , J. , Mombaerts, P . , Nordone,P., 
Ring,B., Ringwald,M., Rodriguez, I . , Sakamoto, N., Sasaki, H., 
Sato,K., Schonbach,C. , Seya,T., Shibata,Y., Storch,K.F., Suzuki, H., 
Toyo-oka,K., Wang,K.H., Weitz,C, Whi ttaker , C . , Wilming,L., 
Wynshaw-Boris, A. , Yoshida,K., Hasegawa,Y., Kawaji,H., Kohtsuki,S. 
and Hayashizaki, Y. 

CONSRTM RIKEN Genome Exploration Research -Group Phase II Team and the 
FANTOM Consortium 

TITLE Functional annotation of a full-length mouse cDNA collection 

JOURNAL Nature 409 (6821), 685-690 (2001) 
PUBMED 11217851 
REFERENCE 5 

AUTHORS Okazaki,Y., Furuno,M., Kasukawa, T . , Adachi,J w Bono,H., Kondo,S., 
Nikaido,I., Osato,N., Saito,R., Suzuki, H., Yamanaka,I., 
Kiyosawa,H., Yagi,K., Tomaru,Y., Hasegawa,Y., Nogami,A., 
Schonbach, C. , Gojobori,T . , Baldarelli, R. , Hill, D.P., Bult,C, 
Hume,D.A., Quackenbush , J. , Schriml, L .M. , Kanapin,A., Matsuda,H., 
Batalov,S., Beisel , K.W. , Blake, J. A., Bradt,D., Brusic,V., 
Chothia,C, Corbani, L. E. , Cousins, S., Dalla,E., Dragani,T . A. , 
Fletcher, C .F. , Forrest, A., Frazer, K. S . , Gaasterland, T . , 
Gariboldi, M. , Gissi,C, Godzik,A., Gough,J., Grimmond,S., 
Gustincich, S . , Hirokawa,N., Jackson, I . J. , Jarvis,E.D., Kanai,A., 
Kawaji,H., Kawasawa,Y., Kedzierski , R.M . , King,B.L., Konagaya , A/, 
Kurochkin, I . V. , Lee,Y., Lenhard,B., Lyons, P. A., Maglott, D . R. , 
Maltais,L., Marchionni , L . , McKenzie,L., Miki,H., Nagashima, T . , 
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SCORE Search Results Details for Application 

10729895 and Search Result 
20070109_162406._us-10-729-895a-l-rnpbn. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10729895 and Search Result 

20070109_162406_us-10-729-895a-l.rnpbn. 

start I next page 

Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



January 11, 2007, 02:13:47 ; Search time 380 Seconds 

(without alignments) 
7745.241 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-729-895A-1 
1080 

1 atgcctttccttttgggtct aaagcaaccagggtagggga 1080 



Scoring table: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 3650718 seqs, 1362588608 residues 

Total number of hits satisfying chosen parameters: 7301436 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000. 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_NA_New: * 

1: /EMC_Celerra_SIDS3/ptodata/2/pubpna/US09_NEW_PUB.seq:+ 

2 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US06_NEW_PUB . seq : * 

3 : /EMC_Celerra_S IDS3/ptoda ta/2/pubpna/US07_NEW_PUB . seq : * 

4 : /EMC_Celerra_SIDS3/ptodata/2/pubpna/US08_NEW_PUB.seq:* 

5: /EMC_Celerra_SIDS3/ptodata/2/pubpna/PCT_NEW_PUB.seq:* 

6: /EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB.seq:* 

7 : /EMC_Celerra_SIDS3/ptodata'/2/pubpna/USll_NEW_PUB . seq : * 

8: /EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB.seql:* 

9: /EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB.seq2:* 
10: /EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB.seq3:* 
11: /EMC_Celerra_SIDS3/ptodata/2/pubpna/US60_NEW_PUB.seq:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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789 
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73. 


1 


39315 


7 


US- 


11 


-033- 


-056A- 


36264 


Sequence 


36264, 


A 


c 


2 


4 30 


2 


39. 


8 


35944 


8 


US- 


11 


-266- 


-748A- 


59993 


Sequence 


59993, 


A 


c 


3 


430 


2 


39. 


8 


341983 


7 


US- 


11 


-033- 


-056A- 


37870 


Sequence 


37870, 


A 


c 


4 


4 30 


2 


39. 


8 


684973 


8 


US- 


11 


-266- 


-748A- 


32013 


Sequence 


32013, 


A 


c 


5 


366 


6 


33. 


9 


4 20 


8 


us- 


11 


-266- 


-748A- 


350361 


Sequence 


350361 


r 
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6 


366.6 


33 


9 


420 


8 


US- 


11 


-266 


-748A-433740 


Sequence 433740, 




7 


350.4 


32 


4 


352 


8 


us- 


11 


-266 


-748A-33938 


Sequence 


33938, A 




8 


330.4 


30 


6 


435 


6 


US- 


10 


-513 


-369-5134 


Sequence 


5134, Ap 




9 


224 


20 


7 


138251 


6 


US- 


10 


-541 


-749-96 


Sequence 


96, Appl 




10 


123.2 


11 


4 


736 


8 


US- 


11 


-266 


-748A-193810 


Sequence 


193810, 




11 


123.2 


11 


4 


736 


8 


US- 


11 


-266 


-748A-227330 


Sequence 


227330, 




12 


123.2 


11 


4 


810 


7 


US- 


11 


-371 


-354-58590 


Sequence 


58590, A 




13 


123.2 


11 


4 


1000 


8 


US- 


11 


-266 


-748A-116884 


Sequence 


116884, 


c 


14 


123.2 


11 


4 


1000 


8 


us- 


11 


-266 


-748A-159048 


Sequence 


159048, 


c 


15 


123.2 


11 


4 


1166 


8 


us- 


11 


-266 


-748A-73679 


Sequence 


73679, A 


c 


16 


123.2 


11 


4 


1166 


8 


us- 


11 


-266 


-748A-107945 


Sequence 


107945, 




17 


123.2 


11 


4 


1166 


8 


us- 


11 


-266 


-748A-1264 90 


Sequence 


126490, 




18 


123.2 


11 


4 


2047 


8 


us- 


11 


-293 


-697-107 


Sequence 


107, App 




19 


80.4 


7 


4 


861 


7 


us- 


11 


-478 


-144-379 


Sequence 


379, App 




20 


63 


5 


8 


63 


7 


US-11 


-511 


-035-48751 


Sequence 


48751, A 




21 


48 


4 


4 


510 


8 


us- 


11 


-266 


-748A-73680 


Sequence 


73680, A 




22 


48 


4 


4 


510 


8 


US- 


1 1 


-266- 


-748A-107946 


Sequence 


107946, 


c 


23 


48 


4 


4 


510 


8 


US- 


1 1 


-266 


-748A-126491 


Sequence 


126491, 




24 


47 


4 


4 


64 


7 


US- 


11 


-511 


-035-532998 


Sequence 


532998, 




25 


47 


4 


4 


894 


8 


us- 


11 


-266- 


-748A-175387 


Sequence 


175387, 




26 


47 


4 


4 


1162 


8 


us- 


11 


-266 


-748A-247578 


Sequence 


247578, 




27 


43.2 


4 


0 


64196 


7 


us- 


11 


-033- 


-056A-38372 


Sequence 


38372, A 




28 


40.2 


3 


7 


1000 


8 


us- 


11 


-266- 


-748A-198617 


Sequence 


198617, 




29 


40.2 


3 


7 


4681 


9 


us- 


11 


-218- 


-305-1801 


Sequence 


1801, Ap 




30 


39.8 


3 


7 


38630 


6 


us- 


10 


-669- 


-920-206 


Sequence 


206, App 


c 


31 


39.2 


3 


6 


3222 


9 


us- 


11 


-218- 


-305-240 


Sequence 


24 0, App 




32 


38.8 


3 


6 


391 


8 


us- 


11 


-266- 


-748A-179520 


Sequence 


179520, 




33 


38.8 


3 


6 


775 


6 


us- 


10 


-374- 


-780A-1099 


Sequence 


1099, Ap 




34 


38.8 


3 


6 


775 


6 


us- 


10 


-374- 


-780A-1102 


Sequence 


1102, Ap 




35 


38.8 


3 


6 


978 


8 


us- 


11 


-266- 


-748A-352617 


Sequence 


352617, 


c 


36 


'38.8 


3 


6 


978 


8 


us- 


11 


-266- 


-748A-435996 


Sequence 


435996, 




37 


38.8 


3 


6 


1000 


8 


us- 


11 


-266- 


-748A-392185 


Sequence 


392185, 


c 


38 


38.8 


3 


6 


1000 


8 


us- 


11 


-266- 


-748A-482903 


Sequence 


482903, 




39 


38.8 


3 


6 


1205 


6 


us- 


10 


-374- 


-780A-1101 


Sequence 


1101, Ap 




40 


38.8 


3 


6 


1205 


6 


us- 


10 


-374- 


-780A-1104 


Sequence 


1104, Ap 




41 


38.8 


3. 


6 


1253 


9 


us- 


11 


-218- 


-305-8594 


Sequence 


8594, Ap 




42 


38.8 


3 


6 


' 2651 


8 


us- 


11 


-266- 


-748A-57574 


Sequence 


57574, A 


c 


43 


■ 38.6 


3. 


6 


1979 


6 


us- 


10 


-612- 


-783-3041 


Sequence 


3041, Ap 


c 


44 


37.8 


3. 


5 


. 1439 


6 


us- 


10 


-449- 


-902-6346 


Sequence 


6346, Ap 




45 


37.8 


3 


5 


3155 


8 


us- 


11 


-266- 


-748A-350021 


Sequence 


350021, 



ALIGNMENTS 



RESULT 1 

US-11-033-056A-36264 

Sequence 36264, Application US/11033056A 
Publication No. US20060292572A1 
GENERAL INFORMATION: 
APPLICANT: STUART, ROBERT O. 
APPLICANT: STUART, ELIZABETH DUFF 
APPLICANT: WACHSMAN, WILLIAM 
APPLICANT: MERCOLA, DANIEL 
APPLICANT: MCCLELLAND, MICHAEL 
APPLICANT: WANG- RODRIGUEZ, JESSICA 
APPLICANT: TARIN, DAVID 
APPLICANT: BERRY, CHARLES C. 
APPLICANT: ARDEN, KAREN 
APPLICANT: WASSERMAN, LINDA 
APPLICANT: GOODISON, STEVEN 
APPLICANT: KLACANSKY, IGOR 

TITLE OF INVENTION: CELL-TYPE-SPECIFIC PATTERNS OF GENE EXPRESSION 
FILE REFERENCE: 15670-073001 
CURRENT APPLICATION NUMBER: US/11/033, 056A 
CURRENT FILING DATE: 2005-01-10 
PRIOR APPLICATION NUMBER: 60/535,382 
PRIOR FILING DATE: 2004-01-09 
PRIOR APPLICATION NUMBER: 60/536,163 
PRIOR FILING DATE: 2004-01-12 
NUMBER OF SEQ ID.NOS: 38888 
SOFTWARE: Paten tin version 3.3 
SEQ ID NO 36264 
LENGTH: 39315 
TYPE: DNA 

ORGANISM: Homo sapiens 
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} 

US-11-033-056A-36264 



Query Match 73.1%; Score 789.8; DB 7; Length 39315; 

Best Local Similarity 99.7%; Pred. No. 2.5e-207; 

Matches 7 91; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 


288 


TTTCAGGTTTTTGCCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCC 


347 






Ml 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 




Db 


36342 


TTTAGGGTTTTTGCCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCC 


36401 


Qy 


348 


AACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACA 


4 07 






1 1 1 1 I 1 1 1 1 1 1 I.I 1 1 1 1 1 1 1 1 1 1 i 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


36402 


AACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACA 


36461 


Qy 


408 


GTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGC 


4 67 






1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


36462 


GTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGC 


36521 


Qy 


468 


ACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGA 


527 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 t 1 1 \ t 1 \ 1 1 1 1 1 1 1 1 t 1 1 1 1 F 1 1 1 F 1 1 1 1 1 i 1 1 1 1 1 1 t 1 1 1 1 L 1 1 1 1 1 1 1 1 I I I 1 1 [ 




Db 


36522 


ACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGA 


36581 


Qy 


528 


CCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCC 


587 






1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 I T 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


36582 


CC CTGATCCCTCTGACCT AC CAGTTGACCGAGCAGCCACCAAAGCC CC AGGGAT GGAGCC 


36641 


Qy 


588 


CAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGC 


647 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 t 1 1 1 1 1 1 1 1 1 1 1 t 1 1 t 1 ! 1 1 1 1 1 1 

1 1 I I ! I 1 1 1 1 1 I 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 ! 1 1 1 




Db 


36642 


C AGT GGCT CT GT GGCT GGCCT GGGGG AGCT GGAC CC GGGGGC CT T C CT GGAC AAAG AT GC 


36701 


Qy 


64 8 


AGAATGTAGGGAGGAGCTGCTGAAAGATGACAGCTCTGAACACGGCGCACCCGACAGCAA 


707 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

i i i i i l i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i [ i i i i i i i i i i i i i i i i i i i i 




Db 


36702 


AGAATGT AGGGAGGAGCT GCTGAAAGAT GACAGCTCTGAACACGGCGC ACCCGACAGCAA 


36761 


Qy 


708 


AGAGAAGACGCCTGGGAGACATCGCCGCTTCACAGGTGACTCGGGCATTGAAGTGTGTGT 


767- 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
• i i i i i i i i i i i i i i i i i i t i i i i j i i i i i i i i i i i 1 i i i i i i i i i i i i » i i i i i i i i i i 




Db 


36762 


AGAGAAGACGCCTGGGAGACAT CGCCGCTT CACAGGTGACTCGGGCATTGAAGT GT GTGT 


36821 


Qy 


768 


GT GCAACC GGGGCCAC CAT G AC GAT GACCT CAAAGAGT TC AACACACT CATC GAT GAT GC 


827 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 P 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 ! 1 1 1 I t 1 1 1 1 1 1 




Db 


36822 


GT GCAACCGGGGCC AC CATGAC GAT GACCT CAAAGAGT TCAACACACT CATC GATGATGC 


36881 


Qy 


828 


TCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGA 


887 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 

i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 




Db 


36882 


TCTGGATGGGCCCCTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGA 


36941 


Qy 


888 


GG AAGGC CT CTGTC AGT C CT CT GAGGAGC AGGCT CG AG AGCCT GGGC ACC CGC ACCT GCC 


947 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


36942 


GGAAGGCCT CTGTC AGT CCTCT GAGGAGC AGGCT CG AG AGCCT GGGC ACC CGC ACCT GCC 


37001 


Qy 


94 8 


ACGGCCGCCCGCATGCCTGCTGCTGAACACCATC7\ACGAGCAGGACTCTCCCAACTCCCA 


1007 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


37002 


ACGGCCGCCCGCATGCCTGCTGCTGAACACCATCAACGAGCAGGACTCTCCCAACTCCCA 


37061 


Qy 


1008 


GAGCAGCAGCTCCCCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAA 


1067 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


37062 


GAGC AGCAGCT C CC CC AGCT AGAGCAGGT C CT GC C A GC ACC C AGCAACTT GGC AAAGC AA 


37121 


Qy 


1068 


CCAGGGTAGGGGA 1080 




Db 


37122 


1 1 1 1 1 1 1 1 1 1 1 f 1 
CCAGGGTAGGGGA 37134 





RESULT 2 

US-1 1-266-74 8A-59993/C 

; Sequence 59993, Application US/11266748A 

; Publication No. US20060134 663A1 

; GENERAL INFORMATION: 

; APPLICANT: Harkin, Paul 

; APPLICANT: Johnston, Patrick 

; APPLICANT: Mulligan, Karl 

; TITLE OF INVENTION: Transcriptome Microarray Technology and 

; TITLE OF INVENTION: Methods of Using the Same 

; FILE REFERENCE: 55815-0102 (319189) 

; CURRENT APPLICATION NUMBER: US/ 11/266, 748A 

; ' CURRENT FILING DATE: 2005-11-03 
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PRIOR APPLICATION NUMBER: EP 04105479.2 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105482.6 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105483.4 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105507.0 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105485.9 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105484.2 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: US 60/662,276 
PRIOR FILING DATE: 2005-03-14 
PRIOR APPLICATION NUMBER: US 60/700,293 
PRIOR FILING DATE: 2005-07-18 
NUMBER OF SEQ ID NOS : 483996 
SOFTWARE: Pa ten tin version 3.3 
SEQ ID NO 59993 
LENGTH: 3594 4 
TYPE: DNA 

ORGANISM: Homo Sapiens 
US-11-266-748A-59993 



Query Match 39.8%; 
Best Local Similarity 73.9%; 
Matches 706; Conservative 



Score 4 30.2; DB 8; 
Pred. No. 7.6e-108; 
0; Mismatches 183; 



Length 3594 4; 

Indels 66; Gaps 10; 



Qy 123 ACT CT GGT GGTT CT GGCT GGT GT GGACC AT CAT CAT CAT C CT GAGCTGCTGCTGTGT 179 

III I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I II 
Db 33820 ACTGCT GT GGCT CT GGCT GGTGTGGGCC AC CAT CAT CAT C CT CT T GAGCT GCT GCT T CGT 33761 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 
Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



180 TTGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAA 2 39 
ill I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

33760 CT GCT GC C AC CGCT GAGC C AAGCACT GT CT T C AGGC C AGGC AGT GGC AAC AAGAAAT C AA 33701 

24 0 CCTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTAT-TTCAGGTTTT 2 98 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

33700 CCCGATCACCTGTAG-GAAGCCCATGATTACTCAGCACTGCCACTTTATCTTCAGTTTTT 33642 

299 TGCCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCC 358 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I HIE I I I I I I I I I I I I I EMM 

33641 TGCCAAACTATTTACTACCTTGTTATGAGGGAATGGTTTCCCGATCTCCAACTCTTCCCC 33582 



359 



418 



CACCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACAGTGTGGCCCTG 
I I II I I I M I I I I I I I I I I III I I M M II I 

33581 AAAC AT AT AGT GACTT CC AGCT AT GGC A TGTGGCCCTT 33544 



419 



478 



CAGGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCC 
I I I I I I I I I I I Mill II I M I I M M I I II I II I M 

3354 3 AAGGTGGCAGTGAC CC T GGG ACT C CC AGGAGGC ACAGAGC AGCC 33500 



479 



33499 



CCTT GT CT GAGC CC AGCAGAAGC AGC AC AAGACC CC CAAGC AT C GCT G AC CCT GAT CCCT 538 

Mill III Ml I I II I I I I I I I I I I M I M I I I M III I I I I I I I I II III 

CCTT GGCT GGGCCTAGCAGAAGCAGCACAAGACCCCCACTCATTGCTGACCCTGGGGCCT 334 40 



539 



CTGACCTACCAGTTGACCGAGC AGCC AC CAAAGCCCCAGGGATGGAGCCC AGT GGCT CTG 598 
II I I I I I I II I I I I I I I I I I I I I M I I I I II M I I I I I I II I II II II 

33439 CT GAC — ACCAGCTGAC AC AGCAGC CACC AAAGACTT GGGAAT GGAGCC C AGT GTCT GT G 33382 

599 TGGCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGCAGAATGTAGGG 658 
I I I I I I M M II I I I I I I I I I I I I I I I I I M II I II I I I I I I II I I 

33381 TGGCCAGCCTGGGGGAGTTGGGCCGAGGAGCCTTCCTGTGCAAGGATTTAGAATATATGG 33322 

659 AGGAGCTGCT GAAAGATG AC AGCT CT GAAC AC GGCGC ACC CG AC AGC AAAGAGAAGAC GC 718 
I I I II I I I I II I M I I I I II II I I I I I II II I II 

33321 AGCAGCT GCT GAAT GAT T AC AACT CT GAGC AT GGC AGACTATCCCCGCCCCTC 33269 



719 



33268 



778 



779 



CT GGGAGAC AT CGC CGCTT C AC AGGT GACT CGGGC ATT GAAGT GT GT GT GT GCAAC CGGG 
I I I I I II I 1 I i II II I I I I II II II I I I 

CCCGCCCCCTTCGCCC CAAGC AAAGAGAAGAT AC C A TGTGTGC AACT GGG 33219 



838 



GCCACCATGACGATGACCTCAAAGAGTTCAACACACTCATCGATGATGCTCTGGATGGGC 
■ I I I I I I M I I I I I I I I I I I M II II I I I I I I I I J I II II I I I I I I I I II I 
33218 GCCAACATGACGATGGCCCCAAAGCATTCAACATGCTCATCAGTGATGCTCTGGATGGCC 33159 
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Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



839 CCCTGGACTTCTGCGACAGCTGCCATGTGCGGCCCCCTGGTGATGAGGAGGAAGGCCTCT 8 98 
I i I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I i INN li ill II 
33158 CC CT GG AT GT CC AT GAC AGCTGCC CT GT GC AGTC AC CT GGT G AC AAGG AGC AAT GC CC CT 33099 

899 GTCAGTCCTCTGAGGAGCAGGCTCGAGAGCCTGGGCACCCGCACCTGCCACGGCCGCCCG 958 
I III I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
33098 GCCAGCCATGCAAGGAGCAAGCTTGAGAGTCTGGGCATCTGCACCTACCACGGCCCCCAG 33039 

959 CAT GCC TGCT GCT G AAC A CC AT C AACGAGC AGGACT CT CCC AACT C CC AG AGC AGC AGCT 1018 
I I I I I I I I II I I I I I I I I I II III I I I I II I I II I II I I I I I I I I I I I I 
33038 — TCCCTGCTGCTGAGCACCATGGATGAACAGAACTCTCCCAACCCCTAGAGCAGCAGCT 32981 

1019 CCCCCAGCTAGAGCAGGTCCTGCCAGCACCCAGCAACTTGGCAAAGCAACCAGGG 1073 
I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
32980 — CCCAGCCAGAGCCGGCCCTGTCTGTGCCCAAGACCTTGGCAAAGCAACCAGGG 32928 



RESULT 3 

US-11-033-056A-37870/C 

Sequence 37870, Application US/11033056A 
Publication No. US20060292572A1 
GENERAL INFORMATION: 
APPLICANT: STUART, ROBERT O. 
APPLICANT: STUART, ELIZABETH DUFF 
APPLICANT: WACHSMAN, WILLIAM 
APPLICANT: MERCOLA, DANIEL 
APPLICANT: MCCLELLAND, MICHAEL 
APPLICANT: WANG- RODRIGUEZ , JESSICA 
APPLICANT: TARIN, DAVID 
APPLICANT: BERRY, CHARLES C. 
APPLICANT: ARDEN, KAREN 
APPLICANT: WASSERMAN, LINDA 
APPLICANT: GOODISON, STEVEN 
APPLICANT: KLACANSKY, IGOR 

TITLE OF INVENTION: CELL-TYPE-SPECIFIC PATTERNS OF GENE EXPRESSION 
FILE REFERENCE: 15670-073001 
CURRENT APPLICATION NUMBER: US/11/033, 056A 
CURRENT FILING DATE: 2005-01-10 
PRIOR APPLICATION NUMBER: 60/535,382 
PRIOR FILING DATE: 2004-01-09 
PRIOR APPLICATION NUMBER: 60/536,163 
PRIOR FILING DATE: 2004-01-12 
NUMBER OF SEQ ID NOS: 38888 
SOFTWARE: Patentln version 3.3 
SEQ ID NO 37870 
LENGTH: 341983 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-11-033-056A-37870 



Query Match 39.8%; 
Best Local Similarity 73.9%; 
Matches 706; Conservative 



Score 430.2; DB 7; 
Pred. No. 2.3e-107; 
0; Mismatches 183; 



Length 341983; 
Indels 66; Gaps 



10; 



Qy 123 ACTCTGGT GGTT CT GGCT GGTGTGGACC AT CATC AT CATCCT GAGCTGCTGCTGTGT 179 

III I I li I I I I I I I I I I i I I I III I I I I I I I I I I I 1 I I I I I I I I I I I II 
Db 318732 AC T GCT GT GGCT CT GGCT GGTGT GGGCC AC CAT CAT C AT CCT CT TGAGCT GCT GCTT C GT 318673 

Qy 180 TTGCCACCACCGCCGAGCCAAGCACCGCCTTCAGGCCCAGCAGCGGCAACATGAAATCAA 239 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 318672 CTGCTGCCACCGCTGAGCCAAGCACTGTCTTCAGGCCAGGCAGTGGCAACAAGAAATCAA 318613 

Qy 24 0 CCTGATCGCTTACCGAGAAGCCCACAATTACTCAGCGCTGCCATTTTAT-TTCAGGTTTT 298 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 318612 CCCGATCACCTGTAG-GAAGCCCATGATTACTCAGCACTGCCACTTTATCTTCAGTTTTT 318554 



Qy 

Db 

Qy 

Db 

Qy 



299 TGCCAAACTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCCAACTCCTCCCC 358 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
318553 TGCCAAACTATTTACTACCTTGTTATGAGGGAATGGTTTCCCGATCTCCAACTCTTCCCC 318494 

359 CACCAT ACAGTGCCTT CCAGCT ACAGCAGCAGCAGCTGCTGCCT CC AC AGTGTGGCCCTG 418 
I Mil I I I I I I I I I I I I I I III I I I I I I I II 

3184 93 AAAC AT AT AGT G ACTT CCAGCT AT GGC A TGTGGCCCTT 3184 56 



419 CAGGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGCACAGAGCAGCC 
I I II I I I I I I I Mill I I I I I I I I I I I I I I I I I I I I I 



478 
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